Category : M.Tech Thesis
K-Nearest neighbour classifiers are defined by their characteristic of classifying unlabelled examples by assigning them the class of the most similar labelled examples. Despite the simplicity of this idea, nearest neighbour methods are extremely powerful. They have been used successfully for: • Computer vision applications, including optical character recognition and facial recognition in both still images and video • Predicting whether a person enjoys a movie which he/she has been recommended (as in the Netflix challenge) • Identifying patterns in genetic data, for use in detecting specific proteins or diseases
Human vision is more sensitive to colour than Gray levels. Therefore, colour image processing is important, although it requires more memory to store and longer execution times to process. There are different colour models, and each one is suitable for some application. In the RGB model, a colour image is expressed in terms of the intensities of its red, green, and blue components. In the HSI model, the intensity component is separated from the colour components.
The data scientists at big mart have collected sales data of 2013 for a number of products across 10 stores in different cities. In this study they have analysed the attributes of each product and store and then build a model which finds out the sale of each product at a particular store.
Image compression has been minimising the size in bytes of a graphics file without degrading the quality of the image to an unacceptable level. The reduction in file size allows more images to be stored in a given amount of disk or memory space. It also reduces the time required for images to be sent over the Internet or downloaded from Web pages.
The growth of technology such as WWW, Social networking, Internet of things (IoT), electronic media etc. are responsible for the generation of vast amount of data in our daily routine by interacting with Internet world, Social networking media etc. across the world. It is very important to analyse and understand this unstructured datasets. So, Sentimental analysis is one of the technologies which is used for determining whether the given piece of writing is positive, negative or neutral. It is also known as opinion mining.
Streaming data analysis has attracted attention in various applications like financial records, data analysis, etc. Such type of applications require continuous storage of large amount of data in data warehouse while simultaneously providing quick response time for the queries against the data that is stored in the system. The duration of fetching data varies depending on type of data required from the system. This article presents the performance estimates in terms of MySQL Partition, Hive partition-bucketing and Apache Pig framework. In this article, big data Eco systems and comparative performance analysis of frequently used data retrieval techniques such as MySQL, Hive and Pig are described. From the work presented in the article, it is concluded that the execution time for extracting data becomes very large with growth in data size, particularly in case of MySQL. As compared to MySQL, Hive and Pig takes less time and give better results.
25.07.2018 0 Comments 1269
16.08.2018 0 Comments 1260
13.08.2018 0 Comments 614
03.08.2018 0 Comments 594
24.07.2018 0 Comments 586
27.12.2018 0 Comments 579