Data Mining is the process where we can extract useful information from Huge Data. Lets us understand the term mining which means that we dig the soil and extract something same to data mining work. In Data Mining we have a bulk of data and this data is sometimes available in raw form from that data you extract the useful information that you want after analyzing. The term analysis means you have data and you want to select only the 20% from that data then you can analyze that data and extract useful information.
In another way, you can call data mining knowledge mining because if we take it in the field of science then we can extract the data from the huge knowledge. If we talk about the present era data mining is used in every field where we have a huge amount of data. We can apply data mining to different data which include:
- Transactional Databases
- Relational Databases
- World Wide Web
- Spatial Databases
- Data Warehouse
The data mining is consist of three steps which is:
- Data Pre-Processing
- Data Extraction
- Data Evaluation
1 Data Pre-Processing
In the data pre-processing step we collect the data from different sources. When we collect the data then we start pre-processing that data. In Data Pre-Processing we clean the data if the data has duplicate values we remove duplications. If we have data available in the table then we can check that table whether has a duplicate value if has then removed it. Each row much be unique, and every column contains a single value.
2 Data Extraction
3 Data Evaluation
Applications
- Fraud Detection
- Research Analysis
- Biological Analysis
- Healthcare
- CRM
- Manufacturing Engineering
- Market Basket Analysis
- Lie detection
- Education
- Financial Analysis
Data Mining Types
- Relational Database
- Data Repositories
- Data Warehouse
- Object-Relational Database
- Transactional Database
1 Relational Database
2 Data Warehouse
3 Data Repository
Benefits of Data Mining
- If we use data mining then we can save a lot of costs so we can say that this is cost-effective.
- If we use data mining then we can easily take a decision because when we apply data mining we get analyzed data based on analyzed data we take action.
- If we use data mining we can add this to the already available system and also we can include it in the new one which is the best advantage of data mining.
- Nowadays we are looking for a system that works fast so our wait is over data mining process the data very fast and provide quick result.
- It also provides useful mean knowledge-based information that is very important for any organization.
- Data mining plays an important role in the finding of hidden patterns meaning those data that have not been accessed by anyone are discovered by data mining.
- Data mining also analyzes the behavior.
The drawback of Data Mining
Text Mining
Text mining is the process of extraction of useful text from articles/documents. We can also call it information mining. In text mining, we have unstructured data and we apply mining to that data and extract the useful information and make that data structured. We can use text mining for extracting relevant text or words that are available in the data.
When we start text mining the data we apply clustering to the data. If you don't know what is clustering then read this article you will understand. In clustering, we separate similar words and make a cluster of that words.
In text mining, we have a lot of Unstructured Data available on Google and many other Platforms.
We can use that data for Text Mining and extracting useful information.
In the text, we have data available in three formats.
- Structured data
- Semi-Structured data
- Unstructured data



0 Comments