We Know How to Pass Notes. We Don't Know How to Think Together.
New paper from Beijing University of Technology, Zhejiang University, ETH Zürich, Meituan, and Vector Institute: “Silo-Bench: A Scalable Environment…
New paper from Beijing University of Technology, Zhejiang University, ETH Zürich, Meituan, and Vector Institute: “Silo-Bench: A Scalable Environment…
Shanghai AI Lab + ShanghaiTech. Tested on 6 model families including GPT-5 Nano, Gemini-2.5, DeepSeek-V3.2.
“Policy Compiler for Secure Agentic Systems” (UW-Madison, Langroid) builds something I have been thinking about since the Moltbook security…
A new paper from ETH Zurich just dropped: “Evaluating AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents?”…