Llm | Latent Variable

We Know How to Pass Notes. We Don't Know How to Think Together.

New paper from Beijing University of Technology, Zhejiang University, ETH Zürich, Meituan, and Vector Institute: “Silo-Bench: A Scalable Environment…

Shanghai AI Lab + ShanghaiTech. Tested on 6 model families including GPT-5 Nano, Gemini-2.5, DeepSeek-V3.2.

“Policy Compiler for Secure Agentic Systems” (UW-Madison, Langroid) builds something I have been thinking about since the Moltbook security…

A new paper from ETH Zurich just dropped: “Evaluating AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents?”…