News
Reve AI's Reve 2. 0 Hits #2 on Image Leaderboard Using 10x Fewer GPUs
11+ hour, 14+ min ago (403+ words) Alpha Signal Reve AI's Reve 2. 0 Hits #2 on Image Leaderboard Using 10x Fewer GPUs Reve AI, a Palo Alto startup of roughly 50 people, just gatecrashed the frontier of image generation. Its new model, Reve 2. 0, debuted at #2 on the Artificial Analysis Text-to-Image Leaderboard…...
Together AI's Parallel Kernel Bench Reveals Top Models Fail 70% of Multi-GPU Tasks
1+ day, 1+ hour ago (318+ words) Alpha Signal Together AI's Parallel Kernel Bench Reveals Top Models Fail 70% of Multi-GPU Tasks Every major coding benchmark for LLMs tests single-GPU CUDA kernels. But production AI infrastructure doesn't run on one GPU, it runs on clusters, where the real…...
Anthropic's Claude Science Collapses 60+ Research Tools Into One Desktop App
1+ day, 7+ hour ago (230+ words) Anthropic has just launched Claude Science, a purpose-built AI workbench for scientific research. It is not a new model -- it is a new app that wraps Claude's existing models in a full research environment: one that can run analyses, manage…...
Google Rebuilds ADK Go 2. 0 Around Graph-Based Agent Workflows
1+ day, 5+ hour ago (334+ words) Alpha Signal Google Rebuilds ADK Go 2. 0 Around Graph-Based Agent Workflows Google just shipped ADK Go 2. 0, a major overhaul of its Agent Development Kit for Go. The headline feature is a brand-new graph-based workflow engine that lets you compose agents, tools,…...
Kilo Code Ships Mobile App So 3 M Developers Can Steer AI Agents Anywhere
1+ day, 18+ hour ago (247+ words) Alpha Signal Kilo Code Ships Mobile App So 3 M Developers Can Steer AI Agents Anywhere Kilo Code, the open-source agentic coding platform used by over 3 million developers, just shipped its mobile app on both Google Play and the App Store....
Microsoft's WSL Containers Kill Docker Desktop's Grip on Windows Dev
1+ day, 12+ hour ago (288+ words) Alpha Signal Microsoft's WSL Containers Kill Docker Desktop's Grip on Windows Dev Introduced at Microsoft Build 2026, WSL containers bring Linux container development directly into Windows through WSL, providing a built-in, enterprise-ready way to create, run, and manage Linux containers without…...
NVIDIA's Nemotron 3 Ultra Tops US Open-Weight AI on Brutal New Job Benchmark
5+ day, 6+ hour ago (183+ words) Most AI benchmarks test a model's ability to answer a question. AA-Briefcase tests whether a model can hold down a job. Artificial Analysis just launched the benchmark, and it is one of the most demanding evaluations of real-world agentic capability…...
NVIDIA Shrinks GLM-5. 2 Memory by 1. 8x With NVFP4 Without Losing Accuracy
5+ day, 18+ hour ago (232+ words) GLM-5. 2-NVFP4 is now ready to serve in v LLM. NVIDIA just dropped the official NVFP4 checkpoint of Z. ai's GLM-5. 2, the 744 B-parameter Mo E model built for long-horizon coding and agentic tasks, and it's already deployable with a single vllm serve command. The…...
Anthropic's Claude Mythos 5 Returns After US Government Lifts Infrastructure Ban
5+ day, 10+ hour ago (398+ words) Alpha Signal Anthropic's Claude Mythos 5 Returns After US Government Lifts Infrastructure Ban Two weeks ago, Anthropic's most powerful AI models were pulled from the internet by a government order. Today, the standoff is partially over. Anthropic announced that the US…...
Sakana AI's Coffee Bench Catches Claude Haiku 4. 5 Going Bankrupt Over 90 Days
6+ day, 9+ hour ago (296+ words) Alpha Signal Sakana AI's Coffee Bench Catches Claude Haiku 4. 5 Going Bankrupt Over 90 Days Most AI benchmarks are sprints. A model reads a prompt, generates an answer, and gets scored. Coffee Bench, a new benchmark from Sakana AI and KPMG Japan's…...