LeanLang for Verified Autonomy Hackathon

Hackathon Winners

LeanLang for Verified Autonomy

Winner presentations

1st Prize

Who sacked my cluster?

Judges say

This is the highest-graded project. The judges noted it has a "realistic model of k8s" and proves "sensible theorems," making it the clear winner.

Winner says

I worked on making a formally verified model of Kubernetes in Lean, that was used as a judge during reinforcement learning to train a model that can be used as an SRE agent.

This was my first time working with Lean (or any theorem proving language) and it seemed to fill most of the gaps that I saw as potential issues with the rise of AI agents.

Access the code

Git Repo >

2nd Prize

Sambar & Soundness

Judges say

Tied with Kleisli Triple on both Grade and Expertise, it takes 2nd prize because it achieved "end-to-end verification" and a "working LLM → jq → SQL pipeline," whereas Kleisli Triple was noted as needing "further furnishing" and having potential scalability issues.

Winner says

Vimala S.

The project gave me the opportunity to apply formal verification in a practical setting while learning Lean.

Durwasa

I'm deeply grateful to Dr. Siddhartha Gadgil for the mentorship and resources that made learning Lean4 possible for us.

Access the code

Git Repo >

Honorable Mentions

Five projects that stood out

Kleisli Triple

A toy model of a version control system (VCS) with verified behavior to flag merge conflicts and certify safe actions for collaborating agents.

Naveen Maurya Bhoris Dhanjal Mohammed Farhaan

Access the code

Git Repo >

Judges say

Despite some concerns about scalability, it is technically very strong and was one of the highest-rated projects overall.

FlowGuard

A formal verification framework that proves a foundational safety property: individual agent safety does not guarantee compositional safety. It detects emergent capabilities in multi-agent systems before deployment.

Dev Soni

Access the code

Git Repo >

Judges say

Highly praised by the judges for its theoretical approach to multi-agent systems and for mechanizing "impossibility results."

MARS

A bridge between LLMs and Lean4 that rigorously verifies the legality and soundness of chess mate-in-N puzzles.

Aastha Gupta Saachi Kaup Malhar Patel Rajlakshmi Chavan

Access the code

Git Repo >

Judges say

The highest expertise score among the Grade 2 projects, providing a "nice demonstration of what's possible with Lean + verification."

Caplean

A capability envelope for agentic coding pipelines, ensuring agents only invoke operations within their declared scope.

Chaitanya Garg

Access the code

Git Repo >

Judges say

A very comprehensive submission that delivered all three core objectives, including a bonus Python middleware layer and a Cursor/MCP integration.

Totientfunction

A Lean 4 library that turns redaction policies into proof-carrying artefacts to ensure safe data handling in multi-agent pipelines.

Sagnik Saha

Access the code

Git Repo >

Judges say

The judges specifically highlighted that the "framing, implementation, and theorems are nice," making it stand out among other Grade 2 projects.

Co-Participant Codes

Git: Shanthini Kamaraj> Git: Aman> Git: Abhiram R> Git: Devidas> Git: Shubham Kumar> Git: Nilesh Trivedi> Git: Vignesh Karri> Git: Priyashree>

What We'll Prove

Making Verification Foundational to Autonomy

Lean as Foundation

Use Lean as the grounding layer for AI agents—proving that verification can be embedded at every level of autonomous decision-making.

Invariants First

Formalize system invariants before code generation, proving that transformations preserve semantics even as systems evolve.

Guaranteed Correctness

Attach machine-checkable guarantees to autonomous workflows and agent-generated code—moving from hope to proof.

This hackathon isn't about theoretical exercises. You'll build working systems that demonstrate how verification makes autonomy trustworthy.

Introductions - LeanLang

A taste of LeanLang

Run Code Snippet

A taste of Lean: eval and syntax

Run Code Snippet

Additional Learning on Lean

Lean FRO Lean Communities Learning LeanLangur

Take A Deep Dive - Metaprogramming in Lean

Negating Expressions II: Handling implications

Negating Expressions III: Negating Exists

Challenges

What You'll Work On

You'll work hands-on in teams, guided by experts, tackling real-world challenges at the intersection of formal verification and autonomous AI. The theme for this hackathon is:

Proved Guardrails in Lean for Multi-Agent Systems

Multi-agent systems such as those built on OpenClaw have become extremely popular because of their power across many domains. However, they also introduce significant risks. These risks must be addressed without sacrificing the capabilities that make such systems valuable. Guardrails backed by formal proofs are a natural way to ensure safety without being excessively restrictive. This hackathon is focused on building such verification systems in the Lean Prover.

The hackathon will begin with a one-day workshop on Lean, focused mainly on programming and the basic metaprogramming needed to build these systems. This will be followed by one day of in-person guided project work, and then 10 days of online interaction to complete the projects.

Participants will develop a model of a class of multi-agent systems like OpenClaw in Lean, together with the basic theory needed to reason about them. They will define types, terms, and functions that capture the semantics of these systems in Lean, and prove the core theorems on which proof automation can build. To interface with the Lean code, they may also extend Lean's syntax and build connections to proof automation tools. Participants may choose the style of agentic system they want to formalize, including specialized systems such as those for mathematics.

A successful project will be able to verify correct behaviour and flag risks within a powerful agentic system, without being so defensive that it weakens the system's usefulness. All code will be open-sourced (with acknowledgement of the hackathon sponsor) and should be well documented. After the hackathon, we aim to aggregate the various contributions into a common system.

Schedule

Three-Phase Hackathon

March 17, 2026

Applications for Registration Open

March 27, 2026

Applications for Registration Close

April 1, 2026

Confirmation of Participation Intimated

April 17 (Fri), 18 (Sat), 2026

Hackathon

Rama Rao Auditorium, Indian Institute of Science (IISc), Bangalore

Introductory talks on Lean & verified autonomy
Problem framing sessions
Team formation
Hands-on guided modeling sessions
Food and coffee will be provided for in-person attendees

April 19 – May 1, 2026

Online Collaboration Phase

Remote / Online (Tulip / Discord channels TBA)

Structured periodic check-ins
Open discussions and peer collaboration
Mentor support from experts
Iterative refinement of formal models

May 1, 2026

Submission of the Projects

May 5, 2026

Closing Session and Announcement of Results

Primarily Online (Bangalore participants encouraged to attend in person)

Final demos and presentations
Spec + refinement walkthroughs
Lessons learned discussions
Winner announcements
Future collaboration pathways

Hackathon: LeanLang for Verified Autonomy

The Paradigm Shift in AI Development