Member-only story
š AI Society 5.23.25 ā HealthBench, Superclusters, and Compression Breakthroughs
š§ From neurons to agents
MKWriteshare draws parallels between brain architecture and AI agent models ā a must-read for visualizing where cognition and computation converge.
āļø OpenAIās new āHealthBenchā isnāt fully trusted ā yet
Dr. Rachel Draelos explains that HealthBench, a benchmark of 5,000 synthetic conversations, fails to fully reflect clinical safety. Single-turn prompts, rubric grading, and layperson scenarios leave gaps in real-world performance.
š Hyperscaler chessboard: Microsoft, OpenAI, and their global entanglements
- Microsoftās Aurora AI weather system now forecasts 10 days out and models pollution and renewables
- OpenAI announces a 1-gigawatt data supercluster in the UAE ($20B scale) ā the Emirates will also become the first nation with ChatGPT Plus for all citizens
š¾ LMCompress: The LLM-based compression leap
Dr. Ashish Bamania outlines how this new method doubles existing standards like JPEG-XL, FLAC, and H.264 ā and quadruples bz2 on text files. Compression is back.
š§Ŗ Agent development frameworks are maturing fast
Googleās new ADK expands support for multiple agent types, showing howā¦