Please use this identifier to cite or link to this item: http://ir.juit.ac.in:8080/jspui/jspui/handle/123456789/9935
Full metadata record
DC FieldValueLanguage
dc.contributor.authorDabral, Shivam-
dc.contributor.authorMohana, Rajni [Guided by]-
dc.date.accessioned2023-09-12T12:31:26Z-
dc.date.available2023-09-12T12:31:26Z-
dc.date.issued2023-
dc.identifier.urihttp://ir.juit.ac.in:8080/jspui/jspui/handle/123456789/9935-
dc.descriptionEnrolment No. 191273en_US
dc.description.abstractBig Data Processing is a matter of interest for many companies around the globe as they try to harness the true power of data. Similarly Nference labs private limited is trying to make use of healthcare data to provide people with better medical support. This project aims at exploring such various techniques that employ engines and frameworks that can generate useful data from raw data effectively and efficiently. Various techniques were examined based upon many research papers and compared. The results suggested the use of Apache Spark as an engine for computation. The data files were stored in parquet format with snappy compression, so that data occupies less space. Hence the aim was to come up with an efficient data generation pipeline that can handle Terabytes of data.en_US
dc.language.isoen_USen_US
dc.publisherJaypee University of Information Technology, Solan, H.P.en_US
dc.subjectPython programmingen_US
dc.subjectApache sparken_US
dc.subjectPseudocodeen_US
dc.subjectHealthcareen_US
dc.titleHealthcare Data Pipelineen_US
dc.typeProject Reporten_US
Appears in Collections:B.Tech. Project Reports

Files in This Item:
File Description SizeFormat 
Healthcare Data Pipeline.pdf1.64 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.