Please use this identifier to cite or link to this item: http://ir.juit.ac.in:8080/jspui/jspui/handle/123456789/6590
Title: Web Crawling Based Data Consolidation
Authors: Bajpai, Shubham
Singh, Sanjana [Guided by]
Keywords: Web crawling
Data consolidation
Issue Date: 2015
Publisher: Jaypee University of Information Technology, Solan, H.P.
Abstract: While planning a software project one faces various troubles and need for guidance the obvious choice for such situations is looking up blog sites that can give us some insight into our problem. Today, very large amounts of information are available in online documents. As a part of the effort to better organize this information for users, researchers have been actively investigating the problem of automatic text categorization. Big data is an evolving term that describes any voluminous amount of structured, semi-structured and unstructured data that has the potential to be mined for information. The bulk of such work has focused on topical categorization, attempting to sort documents according to their subject matter (e.g., sports vs. politics). Contemporary electronic commerce involves everything from ordering "digital" content for immediate online consumption, to ordering conventional goods and services, to "meta" services to facilitate other types of electronic commerce. India has an internet user base of about 250.2 million as of June 2014. The penetration of e-commerce is low compared to markets like the United States and the United Kingdom but is growing at a much faster rate with a large number of new entrants.
URI: http://ir.juit.ac.in:8080/jspui/jspui/handle/123456789/6590
Appears in Collections:B.Tech. Project Reports

Files in This Item:
File Description SizeFormat 
Web Crawling Based Data Consolidation.pdf3.84 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.