ELI/OS
— visitorsWi-FiONLINE— — —
~/about/readme.txtreadme.txt
// HELLO, I'M
EliBenbaruk

I am an engineering student at ESILV passionate about designing and building the next generation of AI-powered and decentralized applications. I enjoy exploring how artificial intelligence and Web3 can be combined to create products that are both technically robust and genuinely useful.

ROLEForward Deployed & AI Engineer
BASED INParis, FR
STATUSOpen to work

Curious about what I build? Take a look at my work on GitHub ↗.

Want to connect? Find me on LinkedIn ↗ or .

EXPLORE THE DESKTOP — CLICK THE ICONS, OPEN THE TERMINAL, HAVE FUN
~/projects8 items
#NAMEYEAR
PROBLEMResearch is bottlenecked by how many hypotheses a team can generate and seriously test. How do you surface testable ideas you wouldn't have had?
APPROACH7 specialised agents in an adversarial loop: generation (expert debate) → critique → falsification (red-team) → safety → Elo tournament ranking → evolution. Claim-level verification: each hypothesis is broken into atomic claims checked against open-access literature (OpenAlex, Europe PMC, arXiv); conjunctive scoring — a single contradicted premise collapses the score.
IMPACTRanked hypotheses with claim graphs, calibrated novelty scores, falsification verdicts and proposed experiments. Budget-driven orchestrator, per-role model routing, reproducible runs with provenance. Unit + integration tests, CI.
ROLESole author — architecture, agents, evals.
PythonPydanticmulti-agent

Eli Benbaruk

Forward Deployed & AI Engineer

Engineering student @ESILV. I make companies AI-native and build autonomous multi-agent systems and on-chain protocols — software that acts, not just answers.

I've been building production software since I was 14. I've designed and deployed multi-agent systems, AI infrastructure and on-chain protocols. 4× hackathon winner.

Localisation : Paris, France

Statut : Open to work — internship / apprenticeship (Paris)

Langues : French — native, English — C1, Italian — B2

Expérience

Forward Deployed Engineer — InternTVA Engineering

Antony (92) · Jun — Jul 2026

Embedded in an industrial engineering firm (~110 people) to make it AI-native. I designed and deployed the whole central AI infrastructure end-to-end: cloud access (Claude API, reusable internal prompts & tools) AND a self-hosted local cluster — networked Mac Minis serving open-source models (Qwen, Gemma) routed per task. I re-architected access to a non-AI-ready ERP (BoondManager) via custom MCP servers, built an in-house ERP, and shipped commercial tooling + a 360° leadership dashboard. Result: dozens of hours/week saved and ~10 large clients won. Security, guardrails and tests throughout.

AI Engineer — InternEuretudes

Paris · Jun — Sep 2025

Helped the company adopt AI and become agentic. Small custom agents per team (recruiting side: sourcing, CV screening, profile formatting, follow-ups, scheduling) and drove day-to-day adoption. Went from near-zero AI to agents used daily by 30+ recruiters; ~50 employees touched by the tooling. Training and explaining for non-technical colleagues.

Software DeveloperFreelance

Remote · 2019 — now

35+ projects shipped end-to-end for SMBs (e-commerce, consulting, services), started at 14. React/TypeScript frontends, custom backends, full lifecycle: scoping → build → launch → support. Google Ads campaigns at ~2× ROAS on average. What it taught me: a product's real value lives in the messy parts of a business.

Formation

Engineering degree — Computer ScienceESILV — École Sup. d'Ingénieurs Léonard de Vinci

Paris · 2024 — 2028

Computer science engineering degree. Fintech major (blockchain, crypto, finance, economics, software development, machine learning) and Startup track. Clubs: Kryptosphere (blockchain), Entrepreneurs Club.

Preparatory classes (CPGE) — MPSI/MP, CS optionPrépa Emc2 Hadamard

Paris · 2023 — 2024

Selective post-bac track preparing for the competitive entrance exams to France's top engineering schools. Maths, physics, engineering science, computer science. Ranked top 20 in France for MP.

Communauté & leadership

MemberDeVinci Blockchain (ex-Kryptosphere DeVinci)

Paris · 2024 — now

Member of the school's blockchain association. I help organize The Great Hack, on track to be Europe's largest student blockchain hackathon (partners: Sui, XRPL, RedStone, iExec, Hyperliquid, MegaETH, Arc, Circle, Hedera, Arbitrum).

MemberEntrepreneurs Club — ESILV

Paris · 2024 — now

Member of the school's entrepreneurship club: projects, events and mutual help around startup creation.

Projets (études de cas)

Heureka (2026)

An AI co-scientist: 7 agents that generate, refute and rank hypotheses.

Problème : Research is bottlenecked by how many hypotheses a team can generate and seriously test. How do you surface testable ideas you wouldn't have had?

Approche : 7 specialised agents in an adversarial loop: generation (expert debate) → critique → falsification (red-team) → safety → Elo tournament ranking → evolution. Claim-level verification: each hypothesis is broken into atomic claims checked against open-access literature (OpenAlex, Europe PMC, arXiv); conjunctive scoring — a single contradicted premise collapses the score.

Impact : Ranked hypotheses with claim graphs, calibrated novelty scores, falsification verdicts and proposed experiments. Budget-driven orchestrator, per-role model routing, reproducible runs with provenance. Unit + integration tests, CI.

Rôle : Sole author — architecture, agents, evals.

Stack : Python, Pydantic, multi-agent

FendForItself — 🏆 Agentic Economy on Arc 2026 (2026)

Autonomous AI-run companies spun up in ~30 seconds.

Problème : Launching a micro-business takes dozens of manual steps. What if agents could own and operate a business end-to-end?

Approche : Each generated company gets its own wallet, USDC funding and a priced API (x402-protected). Agents spawn, pay and coordinate on Arc.

Impact : Winner, Agentic Economy on Arc 2026 (Circle + Arc + Lablab.ai). Demo: 60 buyer agents settling on-chain over real-time SSE. $0.005/call at ~98% margin — viable because USDC is Arc's native gas.

Rôle : Solo build — agents, on-chain integration, frontend.

Stack : Next.js, Gemini, x402, Arc

PACT (2026)

Identity & reputation for autonomous agents.

Problème : Agents that transact need to prove who they are and decide who to trust, with no human in the loop.

Approche : Three phases: cryptographic handshake (capabilities + Ed25519-signed bonds), discovery (need/capability matching), and a Web of Trust (referrals via bonded peers).

Impact : 20+ REST endpoints, network visualization dashboard, two demos (one fully LLM-driven, one scripted). Infra brick for trustworthy agent economies.

Rôle : Protocol design + backend + interface.

Stack : FastAPI, React, Ed25519

onchain-lie-detector (2026)

Automated crypto due diligence via fund-flow graphs.

Problème : Evaluating an on-chain protocol is slow, manual and easy to fake (inflated TVL, Sybils).

Approche : Ingests 4 heterogeneous sources (Alchemy RPC, Dune, Etherscan, DefiLlama), builds a fund-flow graph, runs Louvain clustering + Johnson's cycle detection to spot circular flows and Sybil networks.

Impact : 0–98 trust score across 5 weighted dimensions. 108+ tests. Replaces hundreds of hours of manual investigation per protocol.

Rôle : Design + implementation.

Stack : Python, graphs

zkrust (2026)

A Groth16 zk-SNARK built from scratch, zero crypto deps.

Problème : Truly understand zero-knowledge proofs — not just call a library.

Approche : Full Groth16 over BN254: hand-rolled Montgomery field arithmetic (Fp/Fp2/Fp6/Fp12), optimal Ate pairing, Pippenger MSM, KZG commitments, radix-2 NTT. CLI setup → prove → verify.

Impact : ~6,500 lines of Rust, 159 tests, 6-crate workspace. ~1 ms to generate a proof, ~13.5 ms to verify (Apple Silicon).

Rôle : Sole author — maths, implementation, tests.

Stack : Rust

LiquidVest (2026)

Tradeable token vesting on Solana.

Problème : Vesting tokens are locked and illiquid until the cliff ends.

Approche : Each vesting schedule is a transferable Receipt NFT — selling the NFT sells the future token stream. Linear / cliff / step curves.

Impact : Anchor 0.30.1 program, Node.js + better-sqlite3 indexer, Next.js frontend. Property-based tests (Proptest) + 13 integration tests (Bankrun).

Rôle : On-chain program + indexer + frontend.

Stack : Anchor, Solana, Next.js

HyperVest — 🏆 Hack'inDauphine 2026 — Tokenization track (2026)

On-chain equity vesting with Soulbound Tokens.

Problème : Equity vesting stays opaque, off-chain and hard to verify.

Approche : Vesting anchored on non-transferable Soulbound Tokens, with performance-based unlocks (MRR targets, GitHub commits, KPIs) instead of pure time.

Impact : Winner, Tokenization track at Hack'inDauphine 2026 (Kryptosphere). Solidity + OpenZeppelin v5, Foundry, deployed on HyperEVM Testnet.

Rôle : Smart contracts + frontend (with Ariles Hireche).

Stack : Solidity, HyperEVM

truth-stamped (2025)

Cryptographic anti-deepfake timestamping.

Problème : Prove a piece of media existed at time T, in the age of AI deepfakes.

Approche : On-chain timestamping: SHA-256 + ECDSA signatures + immutable record on Base.

Impact : Live on Base Sepolia, 12 tests, ~$0.001 per stamp. Defensive content-authentication infrastructure.

Rôle : Design + implementation.

Stack : TypeScript, Solidity, Base

Compétences

Outils & technos : Next.js, React, FastAPI, Anchor, Foundry, viem / wagmi, Claude Code, Anthropic API, Gemini API, LangChain, MCP, Solana, EVM, zk-SNARKs, x402, Ed25519

Aussi : Postgres · SQLite · Linux · SQL · Pytest · Proptest / Bankrun · self-hosted LLMs (Qwen, Gemma)

Liens