benoit.barthelet@gmail.com • +33 698 609 237 • 48 years old
13, rue d’Alexandrie - 75002 Paris, France
| AI/LLM Engineering & MLOps | Cloud Infrastructure (AWS, GCP) | Capital Market Products & Processes |
| API design & Backend Architecture | Data Pipelines & Migrations | Team Enablement & Coaching |
| Python / asyncio (expert) | PostgreSQL (advanced) | Open-Source Development & Maintainership |
deepseek-acp-adapter — github.com/euri10/deepseek-acp-adapter Open-source protocol adapter bridging DeepSeek reasoning models (R1, V3) to any ACP-capable editor (Neovim, VS Code, Zed). Built to break free from closed, vendor-tied coding assistants. Implements the Agent Communication Protocol for real-time streaming, tool-use, MCP servers and multi-turn conversations.
RAG-powered Financial Intelligence Assistant (SFJ Technologies) End-to-end retrieval-augmented generation system for natural-language querying of proprietary financial datasets. Architected the full pipeline: embedding generation, chunking strategies, vector search (pgvector), hybrid retrieval logic, and LLM orchestration via LiteLLM with model fallback and cost-aware routing across OpenAI, Anthropic, Perplexity.
Local LLM Infrastructure Designed and operate a home-lab inference stack running open-weight models (Llama 3, Qwen, DeepSeek, Mistral) on consumer GPU hardware using vLLM, Ollama, and llama.cpp. Uses this as a proving ground for model evaluation, prompt engineering, and agentic workflow patterns before bringing them into production contexts.
uvicorn — uvicorn.org Former maintainer of the lightning-fast ASGI server built on uvloop and httptools. Powers a significant fraction of Python web services in production today.
litestar — github.com/litestar-org/litestar Contributor to the high-performance ASGI framework. Deep familiarity with the modern Python async stack (msgspec, uvloop, httptools).
SFJ Technologies LLC − Chief Architect and Head of Engineering − Sep 2018 ‣ Present
Fintech generating signals to measure and detect bankruptcy and fraud risks. Scaled the technology environment by deploying enterprise-strength operations and a microservice architecture, all within a regulated data-governance framework.
AI & LLM Engineering
Platform Engineering
Sullivan Cloud − Chief Solution Architect − Sep 2017 ‣ Present
Leading advisory and system integrator for digital transformation of legacy workflows. Designed and developed cloud migration tooling on GCP.
Generali Investments Europe − Portfolio Manager − Dec 2010 ‣ Sep 2017
Actively managed a multibillion portfolio of derivative-based assets and drove the technology transformation of the front-office toolchain.
Lazard Freres Gestion − Portfolio Manager − Jun 2005 ‣ Dec 2010
Created the structured-product and Fund-of-Hedge-Funds desk for ultra-high-net-worth clients.
SGAM Alternative Investments − Fund derivatives trader − Mar 2001 ‣ Jun 2005
Pioneered a new market in hedge-fund derivatives, leading to the first billion-euro capital-guaranteed mutual fund in Europe.
Ecole Supérieure de Commerce de Paris — MSc Finance, Derivatives Trading Thesis: pricing of PCS options (Property-Claim Services) and impact on reinsurance natural-disaster markets
OSCP — Offensive Security Certified Professional (OS-101-035377)
GitLab Certified Associate
Associate Google Cloud Engineer
| LLM & AI | RAG pipeline architecture, embedding strategies & vector search (pgvector), LLM orchestration (LiteLLM), multi-model routing & fallback, prompt engineering, agentic workflows, LLM evaluation & guardrails, ACP (Agent Communication Protocol) |
| AI tooling | Neovim + CodeCompanion, custom DeepSeek ACP adapter, self-hosted open-weight models (vLLM, Ollama, llama.cpp) on local GPU, model evaluation harnesses |
| Languages | Python (expert, deep asyncio), JavaScript / React, C# (Excel-DNA), Rust (learning) |
| Infrastructure | AWS, GCP, Terraform, Docker/Podman, GitLab, Debian daily driver |
| Databases | PostgreSQL (advanced), async migrations, pgvector |
| Security | OSCP-certified; threat modelling for web applications and APIs; AI safety & prompt injection awareness |
| Languages (spoken) | French (native), English (fluent), Chinese (beginner) |
Debugging is twice as hard as writing the code in the first place.
So if you’re as clever as you can be when you write it,
how will you ever debug it? – Brian Kernighan
Inspired by pandoc_resume