Ramon Perez

Data Scientist & Engineer

hello-ramon@pm.me ramonpzg Sydney, AU

Hi there! I'm an engineer and educator specializing in the design and deployment of machine learning systems. I have over 7 years of experience, and have worked on everything from distributed data processing with Dask and PySpark to building and scaling LLM-powered applications from clusters to single devices that run inside robots. My engineering philosophy is ship it, sweat the details and teach it.

At my core, I'm an educator. I've had the privilege of teaching data science and AI at institutions like Johns Hopkins University, London Business School, and Coder Academy, and for organisations like Mastercard, Capital One, Chanel, Bupa, HSBC and more. This experience has shaped how I build products, making me focus on understanding where users get stuck and creating tools and documentation that remove friction. Currently, I teach MLOps at Decoded and co-run a technology consultancy based in the Dominican Republic, helping businesses navigate the complexities of AI adoption.

Experience

Engineering

Research Engineer at Menlo Research

Remote

Jan to Oct-25

Built performance benchmarking and visualization tools for local LLM deployment in robotics contexts. Created robobench (Python/C++ library for measuring real-world LLM performance) and roboviz (real-time visualization of token flow through CPU architecture using Python/Vue.js). Presented research on C++ LLM integration at CppNow 2025. Rebuilt Jan.ai's documentation architecture, created MCP integration tutorials, and conducted user research to shape product strategy. Currently working on simulation platform and data engine infrastructure for the Asimov robotics team.

DX Engineer at Seldon

London

Jun-23 to Oct-24

Actively contributed to Seldon's suite of dev tools with emphasis on the paid machine learning inference products, crafted educational content for the OSS tools, delivered talks at conferences on different ML topics, and engage with our developer community while coordinating the MLOps World Meetup group events, fostering stronger connections and support within our network.

Sr. Product Developer | Instructor at Decoded

Remote

Nov-21 to Present

Created Decoded's initial data engineering platform on AWS, and developed four machine learning (ML) applications for language translation, churn prediction, fraud detection, and content recommendation. I led the creation of our MLOps engineering program (see it here) and contributed to our React-based presentation tool. I still teach for Decoded's biggest clients.

Teaching

London Business School

Programming Instructor

Remote | Jan to Mar-21

Coder Academy

Data Scientist & Educator

Sydney, AU | Sep-19 to Aug-21

Johns Hopkins University

Lead Quantitative Tutor

Remote | Jan to Aug-19

Research

Research Engineer at INSEAD

Remote

Oct-19 to April-21

During my tenure at INSEAD, I worked with a team of outstanding researchers to develop algorithms for analyzing company structures and culture. Our main project focused on generating hierarchical organizational structures (vertical or horizontal) by analyzing historical job advertisements from target companies and similar organizations. We leveraged various natural language processing techniques alongside supervised, semi-supervised, and unsupervised machine learning methods. Additionally, I developed a Python package implementing Latent Dirichlet Allocation to automatically extract and select meaningful topics from company reviews on platforms like Glassdoor. This package not only identified key themes but also provided quantitative cultural measures for organizations based on the discovered topics, offering valuable insights into organizational culture and structure.

Research Associate at Global Financial Literacy Excellence Center

Washington, DC, USA

Sep-18 to April-19

At GFLEC, I led multiple research initiatives focused on retirement and financial education. I developed and deployed a comprehensive MTurk survey examining millennials' financial education seeking behaviors, while simultaneously conducting extensive literature reviews and expert interviews with 20 policymakers and retirement specialists to understand phased retirement programs. Using the Health and Retirement Study dataset, I built statistical models in Stata and R to analyze patterns between future workforce participation and baby boomers' gradual retirement trends, providing valuable insights into retirement planning and financial literacy.

Research Intern at Service Management Group

Kansas City, MO, USA

Oct-16 to Sep-17

Created ArcGIS maps to visualize consumer behavior data, while concurrently conducting sentiment analysis using SPSS and Python, and orchestrating A/B tests to assess the efficacy of various data collection strategies like email, mail, and point-of-sale.

Technical Skills

Languages

Native Speaker: Python, English, JavaScript and Spanish

Intermediate: French and Rust

Technologies

PyTorchDockerFastAPISolidJSNumPypandasAstroGitTauri