David Mike-Ewewie

AI Infrastructure Engineer

Building production ML systems that operate at scale. From distributed training on HPC clusters to real-time inference with sub-10ms latency.

8 AWS & CompTIA Certs
4 Publications
130+ Team Managed
<10ms P99 Latency
Senior ML Infrastructure

Technical Arsenal

97 verified skills across 8 certifications

ML/AI Systems

Machine Learning (ML) Artificial Intelligence (AI) Distributed Training MLOps Model Serving PyTorch TensorFlow Computer Vision NLP Feature Stores

Cloud & Infrastructure

AWS (3 certs) Cloud Computing Cloud Infrastructure Cloud Architecture Infrastructure as Code Virtualization Kubernetes Docker Terraform HPC/SLURM GPU Optimization

Security & Networks

Network Security Cybersecurity Network Management Cryptography Access Control Threat Detection Vulnerability Management Firewalls VPN

Engineering

Python C++ Java SQL DevOps CI/CD Go REST APIs gRPC Apache Kafka Redis

Data Engineering

Data Engineering Data Pipelines ETL Data Warehousing Databases Data Modeling Apache Spark Airflow dbt Snowflake

Systems & Operations

Linux System Administration Operating Systems Troubleshooting Disaster Recovery Backup & Recovery System Configuration Monitoring Ansible

About Me

From Research to Production: Building AI Systems That Scale

I'm an AI Infrastructure Engineer with a rare combination: published research expertise paired with production engineering experience. Currently pursuing my MS in Computer Science at UT Permian Basin while building scalable ML systems that serve millions of predictions daily.

My journey spans from architecting distributed infrastructure for 130+ employees across continents to optimizing ML pipelines that reduced latency by 73% for algorithmic trading. I don't just theorize about systems—I build them, deploy them, and keep them running at 99.9% uptime.

What drives me? The challenge of making AI accessible and reliable at scale. Whether it's implementing distributed training on HPC clusters or building privacy-preserving analytics for libraries, I focus on systems that deliver real value in production.

Featured Projects

Production systems and research that push boundaries

Arctic Sea Ice Classification

Research

Novel multimodal transformer achieving 98.4% accuracy on NSIDC dataset. Fusing optical, SAR, and meteorological data for climate monitoring.

98.4% Accuracy 1TB+ Dataset
PyTorch Vision Transformers HPC Distributed Training
Explore Project

Real-time Trading ML Platform

Production

Built ML infrastructure processing 1M+ predictions daily with sub-10ms latency. Reduced compute costs by 40% while improving throughput 3x.

<10ms P99 1M+ Daily
Kubernetes C++ Redis Feature Store
Explore Project

Enterprise Distributed Systems

Leadership

Architected infrastructure supporting 130+ employees across continents. Achieved 99.9% uptime while reducing operational costs by $200K annually.

130+ Users 99.9% Uptime
AWS Terraform Ansible Monitoring
Explore Project

Adaptive Learning Platform

IEEE FIE 2025

ML system personalizing course sequences for 32K+ students. Reduced poor recommendations by 66% using contextual bandits.

66% Better 32K+ Students
Contextual Bandits FastAPI PostgreSQL Docker
Explore Project

LibraryIQ

Innovation

Privacy-preserving analytics platform for 400K+ volumes. Processing 10M+ API calls with intelligent batch optimization.

400K+ Items 100% Privacy
Computer Vision YOLOv8 Thermal Imaging FastAPI
Explore Project

View All Projects

Explore my complete portfolio including open-source contributions, research projects, and production systems.

Browse Full Portfolio

Let's Connect

I'm building the next generation of AI infrastructure

I'm passionate about solving complex ML infrastructure challenges. Whether you're scaling to millions of users or optimizing for microsecond latency, let's discuss how we can push the boundaries together.

Email Me LinkedIn GitHub