Contractor at OpenAI
Contract work connected to frontier AI systems, adding production-facing model experience to my research background.
Machine learning researcher-engineer
EPFL MSc Data Science, OpenAI contractor, ETH research work, publications at NeurIPS/TMLR, and ML systems experience at Oracle Labs.
I like work where careful modeling, experiments, and engineering all matter: understanding model behavior, building robust learning systems, and turning research ideas into working artifacts.
Background
I move between academic ML research and practical systems work: understanding how models behave, then building tools and experiments that make that understanding useful.
Contract work connected to frontier AI systems, adding production-facing model experience to my research background.
My MSc at EPFL gave me a broad ML foundation: NLP, diffusion models, RL, probability, and data visualization. GPA 5.83/6; Teaching Assistant for Modern NLP.
At ETH, I work on adaptive learning systems, especially test-time meta reinforcement learning for policies that adapt during deployment.
My publications span diffusion interpretability, sparse autoencoders, probabilistic circuits, and tensor factorization theory.
At Oracle Labs, I worked on ML systems for cloud-security defense inside Spark-scale pipelines processing billions of events.
Questions
I am interested in machine learning problems where empirical behavior, theory, and engineering constraints all matter.
How can we understand and control what neural networks compute?
How can systems learn to learn?
How can reinforcement learning become more sample-efficient and reliable?
Applies sparse autoencoders to analyze text-to-image diffusion models, contributing to interpretability methods for modern generative systems.
Establishes a rigorous connection between tensor factorizations and probabilistic circuits, unifying model families and exposing new architecture-search opportunities.
Unifies overparameterized probabilistic circuit architectures and studies low-rank decompositions as a way to understand and compress expressive layers.
Supervised student research on characterizing and mitigating subliminal learning behavior in large language models.
Supervised student research investigating sink-register behavior in diffusion transformers.
Experience
Research Assistant
Developed machine learning systems for cloud security defense in a large-scale Spark pipeline.
MSc Thesis
Investigating test-time meta reinforcement learning for policies that adapt during deployment.
Research Mentor
Mentoring students on mechanistic interpretability research for LLMs and diffusion transformers.
MSc Data Science
Graduate training in machine learning, NLP, diffusion models, reinforcement learning, and probability.
Writing
Wrapped up my final semester of courses and launched this website as a research-engineering profile hub.
Our work on tensor factorizations and probabilistic circuits was accepted to TMLR and received Featured Certification.
Contact
Best way to reach me: antonio.mari02 [at] outlook [dot] com