Please use this identifier to cite or link to this item:
http://ir.juit.ac.in:8080/jspui/jspui/handle/123456789/9935
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Dabral, Shivam | - |
dc.contributor.author | Mohana, Rajni [Guided by] | - |
dc.date.accessioned | 2023-09-12T12:31:26Z | - |
dc.date.available | 2023-09-12T12:31:26Z | - |
dc.date.issued | 2023 | - |
dc.identifier.uri | http://ir.juit.ac.in:8080/jspui/jspui/handle/123456789/9935 | - |
dc.description | Enrolment No. 191273 | en_US |
dc.description.abstract | Big Data Processing is a matter of interest for many companies around the globe as they try to harness the true power of data. Similarly Nference labs private limited is trying to make use of healthcare data to provide people with better medical support. This project aims at exploring such various techniques that employ engines and frameworks that can generate useful data from raw data effectively and efficiently. Various techniques were examined based upon many research papers and compared. The results suggested the use of Apache Spark as an engine for computation. The data files were stored in parquet format with snappy compression, so that data occupies less space. Hence the aim was to come up with an efficient data generation pipeline that can handle Terabytes of data. | en_US |
dc.language.iso | en_US | en_US |
dc.publisher | Jaypee University of Information Technology, Solan, H.P. | en_US |
dc.subject | Python programming | en_US |
dc.subject | Apache spark | en_US |
dc.subject | Pseudocode | en_US |
dc.subject | Healthcare | en_US |
dc.title | Healthcare Data Pipeline | en_US |
dc.type | Project Report | en_US |
Appears in Collections: | B.Tech. Project Reports |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Healthcare Data Pipeline.pdf | 1.64 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.