Please use this identifier to cite or link to this item: http://ir.juit.ac.in:8080/jspui/jspui/handle/123456789/7377
Title: Normalising Macronic text into a uniform language
Authors: Nadda, Nimisha
Mohana, Rajni[Guided by]
Keywords: Macronic text
Uniform language
Issue Date: 2016
Publisher: Jaypee University of Information Technology, Solan, H.P.
Abstract: SMS are short-length text documents written in a informal style. SMS text processing is challenging because of multi-varied text composition in terms of language, vocabulary, style and quality. In this project, with the help of RapidMiner software tool we have tried to standardize SMS texts. We have worked on American English messages only. With the help of a slang dictionary, we corrected most of the word. In order to improve the efficiency of the system, we created a database to perform next word prediction from. We performed bigram on our corrected dataset, retrieved the previous values to the error, and from our prediction dataset predicted what possible words could be used. Our system gives an accuracy of about 96% and can be further improved.
URI: http://ir.juit.ac.in:8080/jspui/jspui/handle/123456789/7377
Appears in Collections:B.Tech. Project Reports

Files in This Item:
File Description SizeFormat 
Normalising Macronic text into a uniform language.pdf2.02 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.