Skip to content
View maxbaluev's full-sized avatar
🎯
Focusing
🎯
Focusing
  • 21:37 (UTC -03:00)

Block or report maxbaluev

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
maxbaluev/README.md
Typing SVG

Most AI forgets. I build the kind that remembers what worked — and ships the code that proves it.

Email Telegram LinkedIn


🧠 Who I am

Founding / Staff AI engineer. I build self-improving AI systems on the hard primitives — late-interaction retrieval, agent memory, and reinforcement-style credit assignment — and ship the code that proves them.

Most AI forgets: a model scores 90% today and 90% tomorrow; what worked, what failed, what to avoid is generated and thrown away. I work on the opposite — systems whose judgment compounds from reality, getting sharper with use because outcomes are scored and bound to the next decision. That thesis is Accreted Intelligence.


🧭 Selected work — the primitives, in the open

Repo What it proves
accreted-intelligence The thesis + whitepaper: a Recursive Language Model over late-interaction scored-token memory, where the model is a replaceable processor and judgment lives in scored state.
maxsim-rs ColBERT-style late-interaction MaxSim, in clean zero-dependency Rust — the retrieval primitive, done right.
colpali-retrieve Multimodal late interaction — a text query retrieves over document-page images by patch-level MaxSim (ColPali-style).
scored-rerank Ranking that learns from outcomes — Beta-Bernoulli posteriors + Thompson sampling rank by what reality confirmed, not just base similarity.
mcp-retrieve An MCP server exposing late-interaction retrieval as an agent tool — retrieval where agents actually consume it.

🛠️ Stack

Rust Python PyTorch SQLite Linux

Core: late interaction (ColBERT · ColPali · MaxSim) · Recursive Language Models · RL credit assignment · agent memory · retrieval / RAG systems
Tools: Rust · Python · MCP · DSPy · PyTorch · SQLite · browser automation


💼 Background

acc4 / Accreted Intelligence (2025 – now)

Self-improving agent memory: late-interaction retrieval + outcome-credited posteriors, an owner-authority floor, and two reasoners over one substrate.

AMAICo-founder / CTO

Real-time emotional text-to-speech. Led the AI + systems from zero.

InitaAI engineering

Chat-based AI automating website / CRM / ads for SMBs (DSPy + MCP).

TeleportFounding Engineer

Peer-to-peer WebRTC CDN — distributed delivery at the edge.

🤝 Now

Exploring Staff / Principal / Founding AI Engineer roles — remote — in retrieval, agent memory, and RL-for-LLMs.

"Reasoning is getting cheaper. Judgment is not. I build for the part that compounds."

📫 [email protected] · Telegram

Pinned Loading

  1. accreted-intelligence accreted-intelligence Public

    Accreted Intelligence — whitepaper + architecture writeup for acc4, an RLM over late-interaction scored-token memory (engine source private)

  2. colpali-retrieve colpali-retrieve Public

    ColPali-style multimodal late-interaction retrieval (text → document-image patches via MaxSim) — numpy reference implementation

    Python

  3. maxsim-rs maxsim-rs Public

    ColBERT-style late-interaction MaxSim scoring for multi-vector retrieval — tiny, zero-dependency Rust crate

    Rust

  4. scored-rerank scored-rerank Public

    Outcome-scored reranking — rank results by what reality confirmed (Beta-Bernoulli posteriors + Thompson sampling), not just base similarity. Zero-dependency Rust.

    Rust

  5. awesome-multivector-retrieval awesome-multivector-retrieval Public

    Forked from TusKANNy/awesome-multivector-retrieval

    An extensive and commented list of resources on Late-Interaction Multivector Retrieval.

    TeX 1