MSc. Data Science & Computational Intelligence

Yashodeep
Basnet

01

Research & Projects

Production-style projects focused on data pipelines, information retrieval, and machine learning systems.

P01 ETL · PostgreSQL

Retail Sales ETL Pipeline

Automated ETL pipeline ingesting multi-format data sources (CSV, Excel, JSON), applying normalization and cleaning transforms, and loading into a PostgreSQL schema optimized for downstream analytics and dashboards.

PythonPandasPostgreSQLDocker
P02 IR · TF-IDF

Coventry Academic Search Engine

Vertical search engine crawling academic publications to build an inverted index, ranking results via weighted TF-IDF scoring. Results surfaced through a RESTful Flask API supporting structured queries.

PythonWeb CrawlingInverted IndexFlask
P03 NLP · Classification

Automated News Classifier

Text classification pipeline that crawls news articles and categorises them into Business, Entertainment, and Health domains. Achieved a macro F1-score exceeding 0.97 across all classes using a scalable preprocessing pipeline.

PythonScikit-learnNaive BayesCrawling
P04 ML · API Deployment

Fake News Detection API

Processed a corpus of 45,000+ articles with NLP cleaning and feature engineering. Trained and evaluated multiple classifiers; deployed the best-performing model as a real-time inference API.

PythonNLTKScikit-learnFlask
02

Technical Competencies

Core skills developed through machine learning projects, research, and production-level systems.

Machine Learning & AI

  • Supervised learning (XGBoost, Random Forest)
  • Deep Learning & Reinforcement Learning (DDQN)
  • Explainable AI (SHAP)
  • LLMs, Prompt Engineering & Ollama

Data Science & Analytics

  • Data analysis & statistical testing (Wilcoxon, Kendall’s W)
  • Feature engineering & model evaluation
  • Data preprocessing & validation
  • Analytical pipelines & experimentation

Engineering & Deployment

  • FastAPI (ML inference APIs)
  • Docker (reproducible environments)
  • ETL pipelines & Apache Spark
  • Airflow & Medallion Architecture

Databases & Systems

  • PostgreSQL, MongoDB, Neo4j
  • SQL (analytics & optimisation)
  • Data modelling & schema design
  • Distributed data systems fundamentals
03

Education

Academic background aligned with data science and computational intelligence.

2023 — Present

MSc. Data Science & Computational Intelligence

Softwarica College · Coventry University, UK

Focus areas: machine learning, data science, analytics, and distributed data processing.

2019 — 2023

BSc. (Hons) Software Engineering — GPA 4.0 / 4.0

NAMI College · University of Northampton, UK

Strong foundation in backend systems, relational databases, algorithms and software architecture.

04

Contact

Happy to discuss projects, answer questions, or arrange a demo.