Skip to content

A curated collection of AI, data engineering, and DevOps projects featuring real-world applications, advanced techniques, and tutorials—ideal for learners and practitioners exploring data science and machine learning.

Notifications You must be signed in to change notification settings

MelihGulum/Comprehensive-Data-Science-AI-Project-Portfolio

Repository files navigation

Comprehensive Data Science & AI Project Portfolio

A curated collection of projects across Machine Learning, Deep Learning, Data Engineering, Data Analysis, and Cloud/MLOps.

This repository showcases end-to-end workflows, real-world datasets, deployment-ready pipelines, and reproducible best practices.

Each project folder contains a dedicated README with setup instructions, methodology, and results.

Built using modern tools and frameworks commonly used in industry:

Python • Scikit-Learn • TensorFlow
SQL • Kafka • Docker • Terraform
Flask • AWS • GCP

📍 Quick Navigation

Featured Projects

A selection of the most impactful and advanced projects:

Project Domain Key Tools
Urban Sound Classification & Deployment Audio AI CNN, Optuna, Flask
Face Mask Detection (Real-Time) Computer Vision MobileNetV2, Transfer Learning
Urban Sound Research Project Audio ML/DL CNN, LSTM, A/B Testing
NBA Player Stats ETL Pipeline Data Engineering Web Scraping, MSSQL
Terraform Fundamentals Cloud IaC Terraform

Machine Learning Projects

View Machine Learning Projects
Project Name Task Prominent Techniques / Tools
Diabetes Classification Classification Baseline ML Pipeline
Heart Attack Prediction Classification End-to-End Workflow
Medical Cost Prediction Regression Optuna, SHAP
Melbourne House Price Prediction Regression Permutation Importance, SelectFromModel
Clustering Techniques Clustering Silhouette, Davies-Bouldin, K-Elbow
Airline Customer Satisfaction Classification Optuna, Feature Selection, SHAP
Forecasting USD-TRY Exchange Rates Time Series ARIMA, SARIMA, STL, Statistical Tests

Deep Learning Projects

View Deep Learning Projects (10)
Project Name Task Prominent Techniques / Tools
Face Mask Detection Computer Vision MobileNetV2, Transfer Learning
Gender Detection Computer Vision CNN, OpenCV
Email Spam Detection NLP Naive Bayes, SVM, WordCloud
Music Genre Classification Audio Processing Librosa, CNN/LSTM, Flask
ASL Recognition Computer Vision CNN, OpenCV
CIFAR-10 Deployment Computer Vision Augmentation, Flask
Sentiment & Spam Classification NLP LSTM, Flask, MySQL
Face Emotion Recognition Computer Vision FER2013, MySQL, Real-Time
Urban Sound Classification Audio AI Optuna, Flask Deployment
Urban Sound Research Project ML + DL Audio CNN, LSTM, A/B Testing

Data Engineering Projects

View Data Engineering Projects
Project Name Task Prominent Techniques / Tools
NBA Player Stats ETL Web Scraping, MSSQL, Logging
Real-Time & Batch Pipelines with Kafka Streaming ETL Kafka, PostgreSQL, Docker

Data Analysis Projects

I. Exploratory Data Analysis (EDA)

View EDA Projects
Project Name Task Prominent Techniques / Tools
Netflix Originals Visualization Simple EDA
Titanic Feature Engineering Survival Pattern Analysis
MovieLens Visualization Trend & Rating Analysis
Data Science Salary Visualization Job Market Insights
Heart Attack Analysis In-depth EDA Multivariate Exploration
HR Analytics Preprocessing + EDA End-to-End Data Cleaning
Glassdoor Jobs Data Cleaning Selenium + Plotly
Pokémon Dataset Feature Engineering Interactive Plotly Analysis
Auto EDA Benchmark Benchmarking AutoViz vs SweetViz vs Profiling

II. SQL Projects

View SQL Projects
Project Name Level
Portfolio SQL Project Beginner
Netflix SQL Beginner
8 Weeks SQL Challenge Intermediate–Advanced
Hackerrank SQL Beginner–Intermediate

Tutorials

View Tutorials Collection
Tutorial Focus
Librosa Audio Analysis Audio Feature Extraction
15 Python Tips & Tricks Efficient Python
Gentle Guide of Pandas Data Manipulation
Complete Guide on NumPy Numerical Computing
Mastering Cross-Validation Model Evaluation
Ultimate Guide to Data Splitting ML Workflow
Time Series Splitting Visualizations Forecasting
SQL Tutorials Querying & Optimization
Terraform Fundamentals Infrastructure as Code
Mastering Docker Containers & Deployment
Kubernetes Guide DevOps Orchestration
Tutorial on Image Processing Computer Vision & Image Processing Fundamentals

Cloud & DevOps

View Cloud & DevOps Projects
Project Name Task Tool
Terraform Fundamentals IaC Terraform
Mastering Docker Containerization Docker
Kubernetes Guide Modern DevOps Kubernetes

🗺 Roadmap

Planned expansions:

  • Transformer-based NLP projects
  • GANs and self-supervised learning use cases
  • Spark + Kafka large-scale streaming pipelines
  • Full MLOps deployment workflows (CI/CD + Kubernetes)
  • Advanced interpretability methods (LIME, PDP, ICE, TreeSHAP)
  • Distributed computing tutorials

🤝 Contributing

Contributions are welcome!

Feel free to:

  • Open issues
  • Submit pull requests
  • Suggest new project ideas
  • Improve documentation

📌 This repository is continuously growing as I expand my work across AI, ML engineering, and scalable deployment.

About

A curated collection of AI, data engineering, and DevOps projects featuring real-world applications, advanced techniques, and tutorials—ideal for learners and practitioners exploring data science and machine learning.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published