Skip to content

Latest commit

Β 

History

History
51 lines (36 loc) Β· 1.49 KB

File metadata and controls

51 lines (36 loc) Β· 1.49 KB

πŸŽ“ Student Performance Prediction (UCI Dataset)

This machine learning project predicts student performance based on various academic, demographic, and behavioral factors. The goal is to classify whether a student is likely to pass or fail using supervised classification techniques.

πŸ“‚ Dataset

πŸ“Š Features

The dataset includes:

  • Demographic info (e.g., sex, age, address)
  • Family and social info (e.g., famsize, Pstatus, schoolsup)
  • Academic info (e.g., studytime, failures, absences, grades G1, G2, G3)

Target variable:

  • pass β†’ Created as 1 if G3 >= 10, otherwise 0

βœ… Project Steps

  • Data Loading and Preprocessing
  • Feature Engineering
  • Encoding categorical variables
  • Train/Test Split
  • Model training (Random Forest Classifier)
  • Model evaluation (Accuracy, Classification Report)
  • Visualizations:
    • Grade distribution
    • Correlation heatmap
    • Study time vs Pass
    • Feature importance

🧠 ML Model Used

  • RandomForestClassifier (from sklearn.ensemble)
  • Accuracy achieved: XX% (fill with your model score)

πŸ“ˆ Visualizations

  • Histogram of grades
  • Heatmap of feature correlations
  • Boxplots of study time vs pass/fail
  • Barplot of feature importances

πŸ§ͺ Requirements

Install required packages:

pip install -r requirements.txt