- formatting
- images
- links
- math
- code
- blockquotes
- external-services
•
•
•
•
•
•
-
Domain-Specific Evaluations With Real Consequences
Benchmarks for finance, medicine, and cybersecurity that mirror the consequences that matter.
-
Teaching Language Models to Grow Up
BabyLM and psych-inspired batteries that treat LLMs as computational subjects of cognitive science.
-
Beyond the Unlearning Mirage
Dynamic probes and activation analysis that expose brittle unlearning, paired with watermarking and continual learning.
-
LLM Copilots for Peer Counselors
How motivational interviewing–aware sandboxes and analytics help peer counselors level up.
-
Google Gemini updates: Flash 1.5, Gemma 2 and Project Astra
We’re sharing updates across our Gemini family of models and a glimpse of Project Astra, our vision for the future of AI assistants.