GPT-5.4 vs Gemini 3.1 Ultra vs Claude Opus 4.6: Picking the Right Frontier Model
A benchmark-driven comparison of GPT-5.4, Gemini 3.1 Ultra, and Claude Opus 4.6 with practical recommendations by use case
Co-Founder & CTO @ Ailog
I ship AI systems: RAG, agents, fine-tuning. Finishing an MSc in Machine Learning at KTH Stockholm alongside an engineering degree at ENSIMAG Grenoble.
Shipping code, not slides. Papers only when they exist.
I'm Co-Founder & CTO at Ailog, where we build AI features for small and mid-sized teams: retrieval over messy internal documents, agent workflows that replace brittle scripts, automation around the tools clients already pay for. Before Ailog I researched privacy-preserving NLP at INRIA Grenoble, and I'm finishing an MSc in Machine Learning at KTH Stockholm alongside an engineering degree at ENSIMAG Grenoble.
A mix of startup work, research, and older engineering projects.
Co-founded Ailog in March 2025 to build AI features for small and mid-sized teams. We integrate LLMs into existing stacks: RAG over internal docs, agent workflows, automation around tools clients already pay for.
Research at INRIA comparing sensitive information leaks in NLP models across French and English. Analyzed how language structures impact memorization risks in LLMs.
ESP32-based system controlling HVAC, 3D printers, and cameras via Home-Assistant. Arduino programming, Docker containers, and 3D modeling for custom enclosures.
Full-stack platform enabling businesses to deploy custom RAG solutions with document management, multi-language support, and analytics.
Java simulation of autonomous firefighting robots with A* pathfinding, event-driven simulation, and graphical interface for terrain visualization and robot coordination.
Content-aware image resizing using dynamic programming and GPU acceleration with Numba. Achieved 10-100x speedup through vectorized GPU operations.
Designed a RISC-V processor in VHDL with custom extensions, interrupt handling, and peripheral integration (LEDs, HDMI). Validated on FPGA hardware.
Implemented a JPEG encoder from scratch: color space conversion (RGB to YCbCr), DCT, quantization, and Huffman entropy coding for image compression.
Notes from production: RAG, agents, privacy-preserving NLP.
A benchmark-driven comparison of GPT-5.4, Gemini 3.1 Ultra, and Claude Opus 4.6 with practical recommendations by use case
Google's TurboQuant compresses KV cache to 3 bits with zero accuracy loss using rotation-based quantization, enabling 6x memory savings for LLM inference
Proven architectural patterns for deploying multi-agent AI systems that survive contact with real workloads and real users
How combining knowledge graphs with vector retrieval achieves up to 99% search precision and when you should adopt GraphRAG
How to adapt RAG techniques for code search with syntax-aware chunking, code embeddings, and AST-based retrieval
How to set up, run, and integrate local language models using Ollama for development, testing, and privacy-sensitive workloads
Research on privacy-preserving NLP, done at INRIA Grenoble.
arXiv • INRIA Grenoble
Two training schemes against memorization in fine-tuned language models: a masked objective for BERT-style models, a causal one for GPT-style. Both target direct and indirect identifiers. Evaluated on a medical dataset against several baselines.
View PaperExperience and education.
Mar 2025 – Present
Co-Founder & CTO
Ailog • Hybrid
Mar 2025 – Present
Technical Consultant – RAG
Nov 2023 – Mar 2025
Technical Director
Nsigma Junior-Enterprise • Grenoble
Jun – Sep 2024
Research Intern
INRIA • Privacy-Preserving NLP
May – Jun 2023
IT Technician Intern
IE-Concept • IoT & Embedded
2025 – 2026
MSc Machine Learning
KTH Royal Institute of Technology • Stockholm
2023 – 2026
Engineering Degree
ENSIMAG • Grenoble INP
2021 – 2023
Preparatory Cycle
CPP • Valence
AI engineering, research, or client work at Ailog.
Inquiries, collaborations, or just hello
Prefer dev.to, Scholar, or another channel? See all ways to reach me