Data Analyst · Data Scientist · ML Engineer MSc Data Science · University of Naples Federico II · Graduating October 2026
LinkedIn · HuggingFace · parhamkhoshsolat@gmail.com
I ship machine learning end to end. Three of my projects are live as interactive apps on HuggingFace Spaces. One worked on real proprietary data with Procter & Gamble Italy. My MSc thesis is on transparency in human-robot reinforcement learning, extending published work from Federico II.
| Project | Stack | What it does |
|---|---|---|
| Florence-2 VQA | PyTorch, HuggingFace Transformers, Streamlit | Fine-tuned Microsoft Florence-2 (771M params) for visual question answering on a 150K image-question VQA v2.0 dataset. Live demo |
| Pest Population Forecasting | Scikit-learn, XGBoost, LightGBM, Streamlit | Benchmarked 11 models on noisy multi-source sensor data. Random Forest topped both regression and classification with 100 percent recall on the minority pest class. Live demo |
| Stock Clustering Pipeline | Apache Kafka, PySpark, scikit-learn | Real-time pipeline streaming 55 US equities through Kafka topics, with K-means + PCA portfolio segmentation |
| Retail Geospatial Analytics | GeoPandas, Folium, Python, SQL | Industry analytics challenge with Fater S.p.A. (Procter & Gamble × Angelini Industries). Joined proprietary sales data with ISTAT census across all 20 Italian regions. Presented to leadership; received individual recognition. |
| OULAD Time-Series Forecasting | TensorFlow / Keras, Statsmodels, Prophet | Benchmarked SARIMA, ARIMAX, Prophet, and a custom 1D CNN on student interaction data. CNN won on MAE and MAPE. |
| TalentSonar | Gemini API, Streamlit, GitHub GraphQL | Apple Developer Academy 24-hour hackathon. Automated candidate-to-job matching using LLMs. |
Languages: Python, SQL, JavaScript, Bash ML / Deep Learning: PyTorch, HuggingFace Transformers, Scikit-learn, XGBoost, LightGBM, TensorFlow / Keras, Random Forest, LSTM, GRU, CNN, Fine-tuning, Transfer Learning Data Engineering: Apache Kafka, PySpark, ETL, REST APIs, GraphQL, PostgreSQL, MySQL Visualisation & BI: Power BI (DAX), Tableau, Plotly, Seaborn, Matplotlib, GeoPandas, Folium, Streamlit MLOps: Docker, HuggingFace Spaces, Git / GitHub workflows, A100 / GPU training
- MSc Data Science at Federico II (in progress, expected Oct 2026) · GPA 28.3/30 · 3× 30 e lode
- Apple Foundation Program · Federico II × Apple Developer Academy (Jan 2025)
- BSc Information Technology Engineering · Amol University, Iran (2017)
Working on my MSc thesis: research in human-robot interaction and reinforcement learning, extending published work from Federico II.
Open to Data Analyst, Data Scientist, ML Engineer, or AI Engineer roles starting now. Onsite Naples or remote across the EU.