Prediction And Visualization Of Missing Data By Using Data Analytics
DOI:
https://doi.org/10.47750/pnr.2022.13.S09.1072Abstract
The main objective of the proposed work is to deal with data loss in a database as it is well known that data is the most valuable thing for every organization. Loss of data or improper entry of data leads to improper databases which makes maintaining such databases useless. So, the aim is to create user- friendly databases that are more efficient to store and retrieve, along with data prediction, interpretation or data visualization. The proposed work involves machine learning, data analytics and database management. Machine learning algorithms such as random forest classifier and support vector machine are used to predict the lost data and linear regression, logistic regression are used to visualize the data to the user in an understandable manner. Database management is used to create and store the data efficiently. Here, oracle database is taken into consideration and it is chosen as it is one of the popular databases around the world and it is easy for the user to maintain. Data analytics can be performed using python and the missing values in the database can be predicted by analyzing the past trends. The proposed work could be used to effectively predict salaries of the employees, stock price prediction etc.