Data Forge: Dataset Cleansing & Processing

🧹🔥 Clean. Transform. Prepare.

This round focuses on the backbone of every successful data solution—data preprocessing. Teams will work with raw, unstructured datasets and transform them into clean, usable formats ready for analysis and modeling.

Data Forge

🧹 Round Format

  • Dataset Provided: Raw and messy dataset
  • Challenge Type: Data Cleaning & Preprocessing

📝 Task

  • Identify inconsistencies, missing values, and outliers
  • Clean and preprocess the dataset effectively
  • Perform feature engineering where necessary
  • Prepare the dataset for further analysis or modeling

⚖️ Judging Criteria

  • Handling Missing/Outlier Data – Effectiveness in managing incomplete or abnormal data
  • Feature Engineering Quality – Relevance and usefulness of created features
  • Efficiency & Correctness – Accuracy and optimization of preprocessing steps
  • Data Readiness for Modeling – How well the dataset is prepared for downstream tasks

Participation Details

Team Size: 2

Entry Fee: ₹100

Location: To be announced

Refine the data, refine the outcome 🚀