Assessfy Pvt. Ltd Moderate 5 milestones 100 marks

NSE Stock Sector Clustering using Unsupervised Learning

Target year: TE Sem 5-6 (Mini-Project-IIA/IIB) AICTE: 3 credits · ~75 hrs Bloom: Analyze MU CBCS: DS601/DS701 Mini-Project 2A/2B

Real-world project · AICTE-aligned · AI-graded · Audit-ready certificate

5
Milestones
0
Available mentors
0
Enrolled students
8
Core skills
About this project

Use 5-year price data from 200+ NSE-listed stocks. Compute return-correlation matrix. Apply K-means + hierarchical clustering + DBSCAN to discover sectoral groupings. Compare automatic clustering vs official NSE sector classifications. Find anomalous stocks (outliers).

Course Learning Outcomes (CLOs):
CLO1: Apply unsupervised learning to a real, noisy financial dataset.
CLO2: Compare clustering algorithms quantitatively.
CLO3: Communicate findings via visualizations (dendrograms, t-SNE plots).
CLO4: Use cluster-validity indices appropriately.
CLO5: Reason about clustering vs ground-truth labels.

Industry/societal relevance: Indian quant-fintech (WorldQuant India, Tower Research, Alphaster) hire data scientists with portfolio-clustering skills; placement-relevant.

Milestones
1. Data Acquisition
15 marks 5d
200 stocks × 5 yrs daily close → Pandas DataFrame. Compute daily log returns. Clean splits/bonus.
2. EDA + Correlation Matrix
15 marks 10d
Plot return distributions, correlation heatmap. Identify the most + least correlated pairs.
3. K-Means + Hierarchical
25 marks 14d
Run K-means (sweep k=2-15). Dendrogram. Compare cluster assignments visually.
4. DBSCAN + Outlier Detection
20 marks 14d
Tune ε + minPts. Identify outlier stocks (anomalous in correlation space). Investigate why.
5. Comparison Report
25 marks 18d
Adjusted Rand Index vs NSE-published sectors. t-SNE visualization. 8-page report.
Open internships using this project -->
Skills you'll learn
Python (Pandas + scikit-learn)Unsupervised Learning (K-meanshierarchicalDBSCAN)PCA + t-SNEFinancial returns analysisCluster validation (silhouetteARI)
Tools used
Python 3.11Pandasscikit-learnyfinance / nsepymatplotlib + seabornPlotlyJupyter NotebookGitHub
Prerequisites
Python intermediate; basic statistics (correlationvariance); intro to ML; finance basics
Available mentors

No mentors have signed up for this project yet.

Be the first to mentor
Share
You'll earn — Certificate (PDF)

AICTE-aligned Project Completion Certificate

A formal, audit-ready PDF certificate issued by Assessfy + your institute on successful completion. Includes AICTE credit hours, your evaluator's signature, and a QR code for third-party verification.

Certificate of Project Completion

This is to certify that

has successfully completed the project

NSE Stock Sector Clustering using Unsupervised Learning

Auto-issued on completion QR-verifiable
You'll earn — Digital Badge

Shareable LinkedIn / Resume Skill Badge

A compact, verifiable Open-Badges-2.0-compliant digital credential. Add to your LinkedIn profile, GitHub README, or resume in one click. Recruiters can validate authenticity via a unique URL.

Moderate
NSE Stock Sector Clustering using Uns…
Assessfy
Auto-issued on completion One-click LinkedIn add

Similar Projects you might like

Hand-picked by the recommender from your program & skill area.

Relevant Certifications to boost your application

From the Assessfy Certification library — take one and add it to your resume / LinkedIn before applying.

100 marks AICTE 3cr
Sign up & enroll