Data Mining Tasks, Techniques, and Applications
Flexible Query Answering Systems pp Cite as. In this paper we propose the combined use of different methods to improve the data analysis process. This is obtained by combining inductive and deductive techniques. Inductive techniques are used for generating hypotheses from data whereas deductive techniques are used to derive knowledge and to verify hypotheses. In order to guide users in the the analysis process, we have developed a system which integrates deductive tools, data mining tools such as classification algorithms and features selection algorithms , visualization tools and tools for the easy manipulation of data sets. The system developed is currently used in a large project whose aim is the integration of information sources containing data concerning the socio-economic aspects of Calabria and the analysis of the integrated data.
Data Mining refers to a process by which patterns are extracted from data. Such patterns often provide insights into relationships that can be used to improve business decision making. Statistical data mining tools and techniques can be roughly grouped according to their use for clustering, classification, association, and prediction. Clustering refers to data mining tools and techniques by which a set of cases are placed into natural groupings based upon their measured characteristics. Since the number of characteristics is often large, a multivariate measure of similarity between cases needs to be employed. When looking for how to data mine, Statgraphics provides a number of methods for deriving clusters, including nearest neighbor, furthest neighbor, centroid, median, group average, Ward's method, and the method of K-Means. The results may be displayed as a dendrogram, a membership table, or an icicle plot.
Data mining is the process of discovering patterns in large data sets involving methods at the intersection of machine learning , statistics , and database systems. The term "data mining" is a misnomer , because the goal is the extraction of patterns and knowledge from large amounts of data, not the extraction mining of data itself. The book Data mining: Practical machine learning tools and techniques with Java  which covers mostly machine learning material was originally to be named just Practical machine learning , and the term data mining was only added for marketing reasons. The actual data mining task is the semi-automatic or automatic analysis of large quantities of data to extract previously unknown, interesting patterns such as groups of data records cluster analysis , unusual records anomaly detection , and dependencies association rule mining , sequential pattern mining. This usually involves using database techniques such as spatial indices.