Skip to content

Latest commit

 

History

History
73 lines (55 loc) · 2.73 KB

File metadata and controls

73 lines (55 loc) · 2.73 KB

🧼 Data Analysis & Imputation Project

This project provides a complete workflow for cleaning, imputing, evaluating, sorting, and visualizing datasets. It is designed to help data analysts explore and prepare data for deeper insights and modeling.

🔧 Features

  • Imputation – Handling missing values using statistical and model-based methods.
  • 📊 Evaluation – Comparing imputation results and measuring their effectiveness.
  • 🧽 Cleaning – Removing duplicates, fixing inconsistencies, and filtering out noisy data.
  • 🔍 Sorting & Filtering – Organizing data for clearer analysis.
  • 📈 Visualization – Generating informative charts using Matplotlib/Seaborn to identify trends.
  • 🧑‍💻 Streamlit UI (optional) – Interactive interface to explore data visually.

🧰 Install dependencies:

download dataset



if an error occur for the bokeh library:

pip install --force-reinstall --no-deps bokeh==2.4.3

do not forget:

pip install -r requirements.txt 

🧪 Technologies Used

  • python
  • streamlit
  • pandas
  • numpy
  • sklearn
  • matplotlib
  • bokeh
  • altair
  • plotly
  • io

🏕️pic

da1 da2 da3 da4 da5 da6 da7 da8 da9 da10 da11 da12 da13 da14

📌 Purpose

This project is ideal for practicing:

Real-world data preprocessing

Missing value treatment

Evaluation of imputation strategies

Clear and clean visualization for storytelling