Hello and Welcome!

Gaganjot Shan

I’m Gaganjot Shan, an AI Engineer with over two years of experience in NLP, large language models, and data-driven application development. I love solving complex problems and crafting scalable and sustainable solutions. I am really passionate about environment sustainiability, knowledge sharing and continuous growth

My unique background includes three years as a research scholar at the Indian Institute of Science Education and Research (IISER, Trivandrum), where I built a strong foundation in Biology, Chemistry, Physics, and Mathematics before transitioning to Computer Science. I enjoy integrating this multidisciplinary experience into my data science work. I thrive on tackling tough problems and am always eager to learn something new.

My goal is to contribute effectively to meaningful projects that drive efficiency and positive impact. Take a look around to explore my portfolio and discover how I bring my skills and passion to every endeavor and feel free to check out my resume!

Skills!

Machine Learning & Data Science

Python Machine Learning Deep Learning TensorFlow PyTorch Natural Language Processing (NLP) Generative and Agentic AI Data Analysis Predictive Modelling Statistical Modelling Data Visualization SQL and NoSQL Databases

Web Development

HTML5 CSS3 JavaScript Front-End Development

Tools & Platforms

PowerBI Tableau Docker Postman Git Kubernetes Cloud Platforms (AWS)

Languages

English (C1) French (A2) German (A1) Hindi (C2) Punjabi (C2)

Hobbies & Interests

Bouldering Gardening Strategy Games Swarm Intelligence Ecology

Experience

Machine Learning Engineer

iNeuron | Remote

July 2024 - Mar 2024

Analytics & ML Solutions Developer

Working on an expenditure analysis project, utilizing Business Intelligence to optimize cost management and bulding a machine learning based anomaly detection system. Employing Scrum and Kanban methodologies to streamline project workflow and enhance efficiency.

Show More

Key Responsibilities

  • Engineered ETL pipelines using Python and PostgreSQL, improved financial record processing by 5%.
  • Constructed interactive Power BI dashboards to visualize KPIs impacting cost management.
  • Building a machine learning based anomaly detection system to monitor irregular patterns.
  • Conduct independent research to support findings and provide actionable recommendations.

Technologies in use

  • Python for data processing and analysis
  • Pandas, NumPy for data manipulation
  • Tableau, Power BI for interactive data visualization
  • Alteryx for data preparation and blending
  • SQL (PostgreSQL) for database management
Show Less

Python Developer, Security Research Intern

SAP Labs | Mougins, France

March 2023 - August 2023

Security in next-gen Analytics

Developed a client-server web interface, integrating distributed ledger solutions via REST APIs This intuitive interface provides a window to secure analytics and an efficient access to cross-company shared data, balancing transparency and traceability.

Show More

Front End Developer

Key Responsibilities

  • Enhanced scalability by containerizing the project with Docker and deploying on Kubernetes (Kyma) clusters.
  • Improved robustness through debugging existing modules.
  • Utilized ML-based synthetic data generation techniques to preserve privacy.

Achievements

  • Successfully developed and deployed a scalable and secure application.
  • Increased KPI anomaly detection accuracy by 5.42%.
  • Created and maintained comprehensive documentation for setup, development, and deployment processes.
Show Less

Software Engineer Intern

Diesel Loco Modernisation Works, Indian Railways | Patiala, India

June 2018 - August 2018

Networking in Transport Systems

Enhanced railway inter-network communication using Python socket programming. Implemented data serialization and compression techniques to minimize payload size, resulting in a 3% improvement in system responsiveness and increased data transmission efficiency.

Show More

Key Responsibilities

  • Engineered a custom application-layer class with Python sockets.
  • Collaborated with cross-functional teams to diagnose and resolve software development challenges.
  • Delivered presentations translating technical details into actionable insights for stakeholders.

Achievements

  • Successfully improved network communication efficiency.
  • Received commendations for innovative problem-solving and effective collaboration.
  • Implemented solutions that led to a 3% improvement in the overall system performance.
Show Less

Mathematics for Data Science, Trainee

Indian Institute of Space Science and Technology | Thiruvananthapuram, India

May 2017 - June 2017

Young Talent Nurture

Developed predictive models for equipment failure forecasting and analyzed large datasets using SQL and Tableau to identify trends and patterns. Applied advanced statistical techniques to optimize maintenance scheduling and engaged in advanced mathematics training to enhance analytical and problem-solving skills.

Show More

Key Responsibilities

  • Developed predictive models to forecast equipment failures.
  • Analyzed large datasets to identify trends and patterns using SQL and Tableau.
  • Engaged in advanced mathematics lectures to cultivate pathological questioning and problem-solving skills.

Achievements

  • Boosted equipment failure forecasting accuracy by 2-5%.
  • Successfully visualized and analyzed over 50,000 data points.
  • Enhanced problem-solving skills through advanced mathematics training.
Show Less

Research Projects

Oscillatory Neural Network for Voice Spoofing Detection

EURECOM

September 2022 - February 2023

View Project

Developed an innovative neural network model to detect synthetic speech and mitigate voice spoofing attacks.

  • Implemented coRNN model to effectively detect voice spoofing on ASV Spoof 2019 logical attacks database.
  • Optimised the architecture by introducing bidirectionality, boosting model performance by 1.9 percentage points, resulting in a best equal error rate of 6.8%.
  • Assessed the efficacy of the model in mitigating the EVGP through benchmark assessments.

Event Causality Detection using NLP

EURECOM

January 2022 - June 2022

View Project

Build a neural network integrating Bidirectional GRU for named-entity recognition, targeting cause and effect relations in text using Natural Language Processing techniques.

  • Attained an 85% accuracy rate on "SemEval-2010 Task 8" building a data-oriented neural network for event detection.
  • Analyzed, processed, and visualized over 10,000 event-related data points, integrating word embeddings for captured semantic relationships.
  • Engineered a scalable and maintainable codebase, adhering to software engineering best practices.

Education

Postgraduate
Industrial AI and Data Science

Centrale Méditerranée et l'Institut 3IA

Nice, France

2025 - 2026

Focus on industrial experience in artificial intelligence and data science engineering practices.

  • In total 6 months of industrial projects with multiple companies, followed by a 6 months internship.
  • Highlights:
    Artificial Intelligence, Large Language Models, Quantum Machine Learning, Data Mining, Deep Learning, Advanced Mathematics, Computer Vision, Ethical and Responsible AI, Big Data Analytics

Masters of Science
Data Science and Engineering

EURECOM

Biot, France

2021 - 2023

Specialized in advanced data science techniques and engineering practices.

  • GPA: 3.62/4
  • Conducted research projects in Voice Spoofing Detection and Event Causality Detection
  • Highlighted Courses:
    Cloud Computing, Databases, Machine Learning, Data Mining, Deep Learning, Bayesian Statistical Learning, Semantic Web, Image Processing, 3D and virtual imaging, Mobile application and services, Security and privacy for big data and cloud

Bachelors of Engineering
Computer Science and Engineering

Chandigarh University

Mohali, India

2016 - 2020

Built a strong foundation in computer science principles and software development.

  • GPA: 7.95/10
  • Participated in coding competitions and developed various software projects
  • Highlighted Courses:
    Operating System, Design and Analysis of Algorithms, Digital Communication and Computer Networks, Computer Organisation and Architecture, Big Data Analytics, Data Warehouse and Data Mining, Information Retrieval, Genetic Programming

Certificates and Awards

Cybersecurity Certificate

Cybersecurity

Issuer: Forage

Date: September 2024

Skills (In Progress): Access Control, Application Security Hygiene, Exploratory Data Analysis, Text-Based ML Models, Web Application Development, Email Security Fundamentals

Data Analytics Certificate

Accenture Data Analytics

Issuer: Forage

Date: September 2024

Skills: Data Analysis, Business Intelligence, Data Visualization, Problem-Solving, Client Communication

PwC Switzerland Power BI

PwC Switzerland Power BI

Issuer: Forage

Date: July 2024

Skills: Calculating measures, Defining KPIs, Insight and actions, Insights and actions, Power BI, Power BI dashboard, Self-reflection

Data Science Certificate

BCG Data Science

Issuer: Forage

Date: April 2024

Skills: Python, Business Understanding, Hypothesis Framing, EDA, Data Visualization, Mathematical Modelling, Model Evaluation, Client Communication, Model Interpretation

PowerBI

Power BI Essential Training

Issuer: LinkedIn Learning

Date: March 2024

Skills: Business Intelligence (BI), Data Analysis, Microsoft Power BI

Award

SAP Alumni Award

Issuer: SAP Labs

Date: October 2023

Skills: Decision Making, Growth Mindset, Influence, Innovation, Problem Solving, Project Management, SAP Product Knowledge, Time Management

Discrete Mathematics

Discrete Math and Analyzing Social Graphs

Issuer: HSE National Research University (Coursera)

Date: May 2021

Skills: Discrete Mathematics, Graph Theory, Social Network Analysis, Combinatorics, Mathematical Logic, Set Theory, Algorithm Analysis

Discover My Work

Welcome to my collection of notebooks and projects!
Explore topics like data analytics, machine learning, MLOps, deep learning, NLP, and more.
Each project is linked to its GitHub repository, so you can download and experiment with the code if you like.
I'm always open to suggestions, improvements, or even a friendly roast.
From data cleaning and visualization to deep-dive analysis, my work showcases a wide range of skills.
I stay up-to-date with the latest tech and methods through continuous online learning.
Feel free to explore my work and reach out!

AI Voice Assistant

LLM: AI Voice Assistant with RAG and Mistral

This project implements an AI-powered voice assistant for technical support, using speech recognition, natural language processing, and text-to-speech technologies.

MLOps Pipeline on AWS

Industry-Standard MLOps: A Practical Showcase

This project showcases an industry-standard MLOps pipeline for predicting student performance using a diverse dataset. It encompasses data ingestion, transformation, model training, evaluation, and deployment on AWS, utilizing services like EC2 and S3 while using modular coding and logging.

Resume Analyzer

AI, NLP & Streamlit: Gemini Resume Analyzer

If you are a recruiter reading about my project, my resume analyzer probably worked!! It's an AI-powered application that analyzes resumes against job descriptions, providing keyword analysis, resume evaluation, and tailored improvement suggestions using Google's Gemini AI model.

Power BI

Data Visualization: Power BI

To create a dashboard in Power BI for visualizing relevant KPIs and metrics in the dataset provided, including podcasts and articles, by PwC, to design intuitive visualizations, and implemented interactive features.

Autism Diagnosis

Data Exploration: Autism Diagnosis

Explored phenotypical and fMRI scans data to uncover insights into neurological patterns and their correlation with behavioral traits.

Data Cleaning in SQL

Data Cleaning in SQL

To practice as well as showcase my SQL proficiency, I transform raw housing data in SQL Server into a structured, analysis-ready format.

Elements

Text

This is bold and this is strong. This is italic and this is emphasized. This is superscript text and this is subscript text. This is underlined and this is code: for (;;) { ... }. Finally, this is a link.


Heading Level 2

Heading Level 3

Heading Level 4

Heading Level 5
Heading Level 6

Blockquote

Fringilla nisl. Donec accumsan interdum nisi, quis tincidunt felis sagittis eget tempus euismod. Vestibulum ante ipsum primis in faucibus vestibulum. Blandit adipiscing eu felis iaculis volutpat ac adipiscing accumsan faucibus. Vestibulum ante ipsum primis in faucibus lorem ipsum dolor sit amet nullam adipiscing eu felis.

Preformatted

i = 0;

while (!deck.isInOrder()) {
    print 'Iteration ' + i;
    deck.shuffle();
    i++;
}

print 'It took ' + i + ' iterations to sort the deck.';

Lists

Unordered

  • Dolor pulvinar etiam.
  • Sagittis adipiscing.
  • Felis enim feugiat.

Alternate

  • Dolor pulvinar etiam.
  • Sagittis adipiscing.
  • Felis enim feugiat.

Ordered

  1. Dolor pulvinar etiam.
  2. Etiam vel felis viverra.
  3. Felis enim feugiat.
  4. Dolor pulvinar etiam.
  5. Etiam vel felis lorem.
  6. Felis enim et feugiat.

Icons

Actions

Table

Default

Name Description Price
Item One Ante turpis integer aliquet porttitor. 29.99
Item Two Vis ac commodo adipiscing arcu aliquet. 19.99
Item Three Morbi faucibus arcu accumsan lorem. 29.99
Item Four Vitae integer tempus condimentum. 19.99
Item Five Ante turpis integer aliquet porttitor. 29.99
100.00

Alternate

Name Description Price
Item One Ante turpis integer aliquet porttitor. 29.99
Item Two Vis ac commodo adipiscing arcu aliquet. 19.99
Item Three Morbi faucibus arcu accumsan lorem. 29.99
Item Four Vitae integer tempus condimentum. 19.99
Item Five Ante turpis integer aliquet porttitor. 29.99
100.00

Buttons

  • Disabled
  • Disabled

Form