Healthcare Data Pipeline

Please use this identifier to cite or link to this item: http://www.ir.juit.ac.in:8080/jspui/jspui/handle/123456789/9935

Full metadata record

DC Field	Value	Language
dc.contributor.author	Dabral, Shivam	-
dc.contributor.author	Mohana, Rajni [Guided by]	-
dc.date.accessioned	2023-09-12T12:31:26Z	-
dc.date.available	2023-09-12T12:31:26Z	-
dc.date.issued	2023	-
dc.identifier.uri	http://ir.juit.ac.in:8080/jspui/jspui/handle/123456789/9935	-
dc.description	Enrolment No. 191273	en_US
dc.description.abstract	Big Data Processing is a matter of interest for many companies around the globe as they try to harness the true power of data. Similarly Nference labs private limited is trying to make use of healthcare data to provide people with better medical support. This project aims at exploring such various techniques that employ engines and frameworks that can generate useful data from raw data effectively and efficiently. Various techniques were examined based upon many research papers and compared. The results suggested the use of Apache Spark as an engine for computation. The data files were stored in parquet format with snappy compression, so that data occupies less space. Hence the aim was to come up with an efficient data generation pipeline that can handle Terabytes of data.	en_US
dc.language.iso	en_US	en_US
dc.publisher	Jaypee University of Information Technology, Solan, H.P.	en_US
dc.subject	Python programming	en_US
dc.subject	Apache spark	en_US
dc.subject	Pseudocode	en_US
dc.subject	Healthcare	en_US
dc.title	Healthcare Data Pipeline	en_US
dc.type	Project Report	en_US
Appears in Collections:	B.Tech. Project Reports

Files in This Item:

File	Description	Size	Format
Healthcare Data Pipeline.pdf		1.64 MB	Adobe PDF	View/Open