Mcb777 Livetitle_temp

Large Language Models

Artificial Intelligence

Machine Learning

Generative Ai Tools

Large Language Models

Topic

·

8.7K followers

·

19K stories

Recommended stories

In
Data Science Collective
by
Rachel Draelos, MD, PhD
HealthBench Does Not Evaluate Patient Safety
HealthBench is a recently released benchmark to evaluate large language models in healthcare. This blog post summarizes what HealthBench…
5d ago
1
Joshua Anang
The math and logic behind ChatGPT. This paper is all you need.
I’m Joshua Anang and I was 17 years old (last year) when I built my own version of ChatGPT. I’m writing this paper to explain the math…
May 7
In
Data Science Collective
by
Marcus K. Elwin
Ten Lessons from a Year Building AI Agents in LegalTechAn AI engineer’s journey optimizing legal workflows with lessons learned from building, deploying, and maintaining intelligent agents.
1d ago
2
1d ago
2
In
The Quantastic Journal
by
Rob Manson
Inside a Language Model’s Mind: Curved Inference as a New “AI Interpretability” ParadigmNew Evidence of the Shape of Thought
May 11
17
May 11
17
In
Data Science Collective
by
Florin Andrei
Train LLMs to Talk Like You on Social Media, Using Consumer HardwareUse your own comments on social media to fine-tune an LLM, and run all fine-tuning on (relatively) inexpensive hardware.
May 10
5
May 10
5

HealthBench Does Not Evaluate Patient Safety

HealthBench Does Not Evaluate Patient Safety

In

Data Science Collective

by

Rachel Draelos, MD, PhD

HealthBench Does Not Evaluate Patient Safety

HealthBench is a recently released benchmark to evaluate large language models in healthcare. This blog post summarizes what HealthBench…

5d ago

Final diagram explaining the entire transformer process ending at the final outputs after encoder-decoder attention.

Final diagram explaining the entire transformer process ending at the final outputs after encoder-decoder attention.

Joshua Anang

The math and logic behind ChatGPT. This paper is all you need.

I’m Joshua Anang and I was 17 years old (last year) when I built my own version of ChatGPT. I’m writing this paper to explain the math…

May 7

Ten Lessons from a Year Building AI Agents in LegalTech

In

Data Science Collective

by

Marcus K. Elwin

Ten Lessons from a Year Building AI Agents in LegalTech

An AI engineer’s journey optimizing legal workflows with lessons learned from building, deploying, and maintaining intelligent agents.

1d ago

Inside a Language Model’s Mind: Curved Inference as a New “AI Interpretability” Paradigm New Evidence of the Shape of Thought

In

The Quantastic Journal

by

Rob Manson

Inside a Language Model’s Mind: Curved Inference as a New “AI Interpretability” Paradigm

New Evidence of the Shape of Thought

May 11

ouch!

In

Data Science Collective

by

Florin Andrei

Train LLMs to Talk Like You on Social Media, Using Consumer Hardware

Use your own comments on social media to fine-tune an LLM, and run all fine-tuning on (relatively) inexpensive hardware.

May 10

How AI Models Fake Alignment and Why You Should Care

Jason Clark

How AI Models Fake Alignment and Why You Should Care

I remember watching “The Usual Suspects” for the first time — it’s one of those movies where you can only truly enjoy it the first time…

May 8

When Algorithms Judge: What we Learned from Examining LLM Decision-Making in the Legal Domain

May Reese

When Algorithms Judge: What we Learned from Examining LLM Decision-Making in the Legal Domain

This post describes the process and results of a 72-hour research sprint project. We explore LLM performance on legal decision-making.

May 7

Reference Architecture for AI Developer Productivity

In

Leading EDJE

by

Matt Eland

Reference Architecture for AI Developer Productivity

Reference Architecture for AI Developer Productivity

May 6

See more recommended stories