Title :
Analysis of protein database for semantic similarity using map reduce — A survey
Author :
Nirmala, G. ; Dinakaran, K.
Author_Institution :
Dept. of Comput. Sci. & Eng., RMD Eng. Coll., Chennai, India
Abstract :
Big data analytics is challenging for storage and analysis of large data sets. This paper is to determine the semantic relationships among Gene Ontology (GO) terms which are the attributes of protein database. This is implemented in a prototype system called PROSIM, which is an User interface with protein data collection. Gene ontology term is a collection of GO graph as an attribute in the database. A proposed technique that determine the contexts of terms based on the concept of existence dependency by map reduce in hadoop. Map Reduce is the parallel-processing engine that allows Hadoop to process and store the large data sets in relatively short order. Determining the semantic similarities among GO terms there by analysis and learning the affected proteins was done. The intensity and risk factor was concluded as per the classification.
Keywords :
Big Data; biology computing; data analysis; database management systems; ontologies (artificial intelligence); parallel processing; proteins; Big Data analytics; GO graph; GO term; Hadoop; MapReduce; PROSIM; gene ontology; protein data collection; protein database; semantic similarity; user interface; Big data; Biomedical measurement; Databases; Diseases; Ontologies; Proteins; Semantics; Gene ontology (GO); Map Reduce; data sets; related terms; semantic similarity;
Conference_Titel :
Computer Communication and Systems, 2014 International Conference on
Print_ISBN :
978-1-4799-3671-7
DOI :
10.1109/ICCCS.2014.7068166