- Jamboree is a renowned education consultancy that has assisted thousands of students in achieving their dreams of studying at top colleges abroad. Specializing in test preparation for GMAT, GRE, and SAT, Jamboree's unique problem-solving methods ensure that students attain maximum scores with minimal effort.
- Recently, Jamboree launched an innovative feature on their website that allows students to check their probability of gaining admission to Ivy League colleges. This feature, tailored specifically for students from India, offers personalized insights into their chances of securing a spot in prestigious graduate programs, further solidifying Jamboree's commitment to supporting students in their academic journeys.
- Identify Key Admission Factors: Determine which variables (e.g. CGPA, GRE Scores, LOR, SOP, Research Experience) significantly impact the probability of admission to Ivy League colleges.
- Understand Interrelationships: Analyze how these factors interact with each other and their combined effect on admission chances.
- Improve Predictive Accuracy: Develop and refine predictive models to accurately estimate a student’s chance of admission based on the identified factors.
- Please check the dataset at: "Dataset Link"
- Serial No. (Unique row ID)
- GRE Scores (out of 340)
- TOEFL Scores (out of 120)
- University Rating (out of 5)
- Statement of Purpose and Letter of Recommendation Strength (out of 5)
- Undergraduate GPA(CGPA) (out of 10)
- Research Experience (either 0 or 1)
- Chance of Admit (ranging from 0 to 1)
- Observations
- Data Shape and Types
- Missing Values
- Statistical Summary
- Univariate Analysis
- Bivariate Analysis
- Duplicate Value Check:
- Outlier Treatment:
- Linear Regression
- Ridge and Lasso Regression
- Calculate the Variance Inflation Factor (VIF) and iteratively drop variables with VIF > 5.
- Ensure that the mean of residuals is close to zero.
- Check for patterns in the residual plot to confirm linearity.
- Verify that the residuals exhibit constant variance.
- Assess if the residuals follow a normal distribution using Q-Q plots and distribution plots.
- Evaluate model performance using metrics such as MAE, RMSE, R^2, and Adjusted R^2.
- Compare performance on training and testing datasets.
- Highlight the most significant predictors
- Discuss potential improvements based on performance metrics.
- Actionable items for implementation of the model in real-world scenarios and the potential business benefits.