The world we that we see today have automated data collection tools, databases systems, world wide web, and computerized society. This results in an explosive growth in data, from terabytes to petabytes.
We are drowning in the ocean of data but starving for knowledge.
A huge velocity, volume, and variety of data are what our new age has provided us. We have cheaper technology, mobile computing, social networking, Cloud computing which has evoked this data storm.
These are the reasons why conventional methods fade away and we need some novel methods like Data mining to process the new era of data.
Data mining is an iterative and interactive process of discovering novel, valid, useful, and understandable patterns and models from massive data sources.
Breaking down the definition of data mining.
The overall process of generating knowledge from massive databases is called KD. It is a more complex process than DM. DM is a step of KD which deals with the identification of patterns in the data.
Let us breakdown the process of KD.
We should have prior knowledge of the application areas where we are going to discover the knowledge. It is observed that having prior knowledge helps the better generation of insights from the data.
Once we have obtained the data from warehouses, we need to remove the noise and the inconsistent data. It may take up to 60% effort in the knowledge discovery process.
#ai #data-science #data-mining #machine-learning