AI Career Trends Dashboard - Project Documentation
Project Overview
This project predicts emerging careers in the AI era by analyzing daily technology trends and discussion data.
It combines data ingestion, natural language processing, and clustering techniques to surface actionable insights.
- Goal: Identify emerging AI-era career paths.
- Approach:
- Collect daily trend data from sources like Hacker News.
- Preprocess and embed text using Transformer models.
- Cluster trends using HDBSCAN.
- Map clusters to potential careers and aggregate.
- Outcome: A dynamic dashboard showing ranked AI career predictions.
Project Structure
ai_career_predictor/
├── fetch_data.py
├── predict_now.py
├── app.py
├── venv39/
├── ai_trends/
│ ├── templates/
│ │ └── index.html
│ └── trends.db
├── readme.html
SQLite Tables
trends: Raw trend data collected daily
predictions: Aggregated AI career predictions
Python Libraries Used
- requests, beautifulsoup4 → Data collection
- numpy, sentence-transformers, torch, hdbscan, scikit-learn → ML & embeddings
- flask, werkzeug.middleware.proxy_fix → Web app & API
AWS Deployment
- EC2: Runs Python virtual environment, hosts Flask API.
- ALB: Routes
/api/* requests to EC2, security group allows CloudFront.
- S3: Serves static
index.html.
- CloudFront:
- Default behavior → S3 static files
- /api/* behavior → ALB
- HTTPS enforced for clients
- Route53: Subdomain
ai.hankshi.com points to CloudFront
Deployment Steps
- Set up Python 3.9 virtual environment and install dependencies.
- Run
fetch_data.py to populate SQLite DB.
- Start Flask app on EC2.
- Configure CloudFront behaviors:
* → S3, /api/* → ALB.
- Invalidate CloudFront cache after updates.
- Access dashboard via https://ai.hankshi.com
Importance & Perspective
- Helps individuals anticipate AI-driven job changes and upskill.
- Provides organizations with trend intelligence for workforce planning.
- Future extensions: automated forecasting, additional sources, visualization dashboards, alerts.
- Demonstrates cost-effective AI pipelines on AWS.