Data & ML Consulting

Your data has answers.
I help you find them.

End-to-end data science, machine learning, and AI integration for businesses that want real results — not just pretty dashboards.

Services

From prototype to production — I cover the full data stack.

Machine Learning Development

Custom ML models built for your specific problem — predictive analytics, classification, regression, anomaly detection — then integrated cleanly into your product or workflow.

Analytics & Business Intelligence

Turning raw data into dashboards and reports your team actually uses. Clear, actionable visualizations built with the right tooling for your stack — no unnecessary complexity.

AI & LLM Integration

Connecting large language models and AI agents to your existing systems — automating workflows, building intelligent assistants, or extracting structured insight from unstructured data.

Data Pipelines & Engineering

Designing and building reliable data infrastructure — collection, transformation, storage, and delivery — so the right data gets to the right place, every time.

Data science with engineering depth.

I'm Jumar Vermeulen — a data scientist and ML engineer based in South Africa. I spent nearly two years as the sole data scientist at a fintech company, owning the entire data function: pipelines, ML models, BI dashboards, AI integrations, and backend software.

My background is a little unusual: I started with a BSc in Physics at the University of the Free State, then completed advanced engineering coursework at Wits. That mix of quantitative rigour and engineering pragmatism shapes how I approach data problems — I care about what works in production, not just what looks good in a notebook.

I work across the full stack. If the problem involves data, I can help.

2+ Years production ML experience
1 Full data function, owned end-to-end
4 Service areas covered
  • University of the Free State BSc — Physics & Engineering 2021
  • University of the Witwatersrand 126 NQF Credits — Mechanical Engineering (NQF7) 2023
  • Harvard University CS50x — Introduction to Computer Science 2024
  • Pierian Training Python for Data Science & Machine Learning 2024
  • LangChain Academy Introduction to LangGraph 2025

Tools & Technologies

Production-grade tools, not just tutorial experience.

Languages & Data

Python SQL PostgreSQL Pandas NumPy

ML & AI

Scikit-Learn LangGraph LLM Integration

Visualisation & BI

Plotly Matplotlib Seaborn Streamlit Metabase Grafana PostHog

Infrastructure & Backend

Docker Django AWS S3 AWS DynamoDB GitHub

Got a data problem?

Whether you have a defined project or just a vague sense that your data could be doing more — reach out. I'll give you a straight answer on whether I can help and what it would look like.