HN.zip

Show HN: Talos – Open-source WASM interpreter for Lean

At Cajal (YC W26) we’re excited to share Talos (https://github.com/cajal-technologies/talos), an open source framework for formal verification of WebAssembly modules in Lean.AI is now writing tons of the code that gets pushed to production. As code generation gets cheaper, verification becomes the bottleneck. We believe in a future where every piece of software comes with a mathematical proof that it does what its author intended - in doing so, eliminating many classes of exploits. Talos is part of the foundation for that.Talos provides a Wasm interpreter optimized for reasoning at the binary level, together with a weakest-precondition calculus layer for proving properties about programs. Because we reason directly about WebAssembly, any language with a Wasm backend is in scope: Rust, C++, Go, C, Swift, Kotlin, Zig, C#, and many more.To make this possible, we use Lean: a programming language and theorem prover that lets you both write software and mathematically prove that it's correct - all in one system. That's what lets Talos double as both an executable interpreter and the formal object Lean reasons about. Lean also integrates with modern AI proving tools, discharging goals automatically via both proof search and direct evaluation.To see Talos in action check out a proof for Stein's GCD algorithm, implemented in the popular Rust crate num-integer: https://github.com/cajal-technologies/talos/blob/main/progra....Our roadmap:- Full Wasm coverage by first passing the official W3C testsuite, then later verifying against SpecTec (formal Wasm spec) - Arbitrary crate verification - any Rust crate that compiles to Wasm should be in scope - Building our proof library codelib, to make verifying increasingly complex programs tractableWe would love to hear the community’s feedback on Talos and comments on the state of formal verification right now. Contributions are also welcome!

78 points by mfornet - 19 comments

19 Comments

BobbyTables2 [3 hidden]5 mins ago
Only thing left is to make a Kanban out of it…
himata4113 [3 hidden]5 mins ago
talos is already in use by https://github.com/siderolabs/talos, was confused for a second when I saw talos and wasm for a second, got excited about native wasm pod support.
jazzyjackson [3 hidden]5 mins ago
Also collides with the Power9 desktop system https://www.raptorcs.com/TALOSII/
jacobjwalters [3 hidden]5 mins ago
What is the program logic used here? The num_integer verification example seems to be hardcoding addresses in the spec; what if I want to reason about larger programs that dynamically allocate, where the addresses may not be known statically? How can I make sure these do not overlap? And since this is a shallow embedding into lean, what’s the approach for verifying properties of non-terminating programs?
mfornet [3 hidden]5 mins ago
> what if I want to reason about larger programs that dynamically allocate, where the addresses may not be known statically? How can I make sure these do not overlap?

We are actively working on this, as it is a pre-condition :P to reason about the simplest of useful programs. The idea is to develop an API around separation logic that allows you to reason about logic that manipulate non-overlapping regions of memory.

It won't be relevant if address are not known statically since API theorems will be parametrized over non-relevant constants such as addresses, function indices, etc...

> And since this is a shallow embedding into lean, what’s the approach for verifying properties of non-terminating programs?

To use the interpreter there is the concept of fuel, which we explicitly hide from the reasoning layer. Using fuel you can write statements of the form, this function returns out of fuel for any value of fuel passed to the interpreter, which is equivalent to prove that your program doesn't terminate.

quietusmuris [3 hidden]5 mins ago
Interesting. Do I have to write specs in Lean against the Wasm semantics or can you annotate Rust directly?
mfornet [3 hidden]5 mins ago
Both.

You can write "annotate" your rust code using asserts. On the wasm side asserts are converted to trap instructions, so the Lean spec will simply be: For every input this code never traps.

Part of our focus is making sure that specs are both easy to write and read, since they are human facing. Eventually you could imagine how writing code will mostly be writing specs, and both the code and the proofs will be handled by AI agents. In this scenario it is very important that humans can easily audit and modify the specs.

quietusmuris [3 hidden]5 mins ago
Doesn't that put the Rust compiler (and its assert lowering) in the trusted base? How do you know the asserts you wrote are the traps you're reasoning about?
IshKebab [3 hidden]5 mins ago
How do you actually prove it though? I understand if it's fully automated SMT-style proof, but doesn't Lean require tediously explicit proofs? If it doesn't prove automatically do you have to write out Lean helper proofs about the compiled WASM?
lukerj00 [3 hidden]5 mins ago
I’m on the Cajal team - not OP, but happy to answer questions.

The core bet is that Wasm is a good verification target (close to compiled artifacts, many languages target it), and Lean is the right place to do verification.

Super interested in hearing from people working with Lean, compilers or other Wasm verification frameworks (eg Iris-Wasm).

jsmorph [3 hidden]5 mins ago
Cool. I've been working on a compiler of a subset of Lean that targets WASM. The compiler is implemented in Lean.

https://github.com/jsmorph/leanexe

I think I managed to use Talos to prove the WAT generated from an example LeanExe program is correct. ?

https://gist.github.com/jsmorph/275a15dc21af037e1d02a1b433be...

Fun.

kdavis [3 hidden]5 mins ago
What other verification targets did you consider?
mfornet [3 hidden]5 mins ago
Initially we considered formalizing rust code, aeneas is a very promising project that would unlock a lot of features right way by transpiling to lean. However, we didn't want to lock ourselves to rust, so we decided to use a lower level target such that we could verify code from "any" language.

We considered LLVM-IR, and RISC-V.

Ultimately WASM felt like the right decisions. More importantly WASM spec is very well done in details, and it is written with formal verification in mind early on, and there are are plans from to include Lean as one of the targets for generating the spec automatically from SpecTec. Once this exist, we will formalize that our interpreter is correct under the definition generated from the official Wasm-Lean-Spec so we remove it from the Trusted-Base going forward.

lukerj00 [3 hidden]5 mins ago
More on this - LLVM-IR has no official formal semantics and it's riddled with UB. RISC-V has a formal model in Sail, but it's an ISA so you throw away the structured control flow and types which we want for proving.

Wasm has different levels we can validate against - starting with W3C test suite, then later full verification against the SpecTec-generated Lean semantics so that we can drop our interpreter from the trusted base.

oulipo2 [3 hidden]5 mins ago
Interesting, have you also looked at other formal methods, like Abstract Interpretation?
CurryFurry [3 hidden]5 mins ago
For "Lean"? LeaRn? Lean Manufacturing? Stupid one-word techbro product names.
johnsonjo [3 hidden]5 mins ago
Lean is a programming language [1]

> Lean is an open-source programming language and proof assistant that enables correct, maintainable, and formally verified code

[1]: https://lean-lang.org/

sohex [3 hidden]5 mins ago
Do people just not even search their proposed name anymore?
IshKebab [3 hidden]5 mins ago
It's ok for multiple different things to use the same name.

https://en.wikipedia.org/wiki/Lynx_(disambiguation)

I wish these comments were banned. They come up every time someone names a project with a name that was also used by one guy for his forgotten Lisp dialect in the 70s.