期刊名称:International Journal of Innovative Research in Computer and Communication Engineering
印刷版ISSN:2320-9798
电子版ISSN:2320-9801
出版年度:2017
卷号:5
期号:4
页码:8368
DOI:10.15680/IJIRCCE.2017.05040348
出版社:S&S Publications
摘要:The term ‘Big data’ refers to the large volumes of structured and unstructured data or the complex datasets that cannot be handled by using a traditional data processing approach. While working with Big Data, it’s not theamount of data that matters but the quality of information that can be extracted from the database. In an organization,‘Big data’ is evaluated for insights that direct to better strategic decisions. Advanced data analytics techniques likepredictive analytics, location intelligence, and data mining are used to process hundreds of terabytes of data forfinancial decision making or business informatics. To manage these large data sets (called ‘Big data’) Hadoop andMapReduce can be used. Hadoop is an open source framework that follows the distributed computing and parallelprocessing approach for the efficient and cost- effective processing of data sets. Another feature of Hadoop Model thatis beneficial for ‘big data’ is its scalability. It can scale up from a single server to large clusters of commodity servers,with a very high degree of fault tolerance.