Byron Crowe (@byrondcrowe) Twitter Tweets • TwiCopy

Byron Crowe

@byrondcrowe

+ Follow

CMO at Solera Health. Academic hospitalist @BIDMChealth @HarvardMed. I tweet about the digital transformation of medicine.

ID:786054098

calendar_today28-08-2012 03:02:12

403 Tweets

229 Followers

192 Following

Byron Crowe

4 weeks ago

This is one of my favorite LLM studies of the year. Not only because it showed that GPT4 outperforms physicians on these summary tasks (which is cool to see that parity), but because it showed how often humans and LLMs still make mistakes in summarization (~5-10% frequency)!

thumb_up_off_alt2

chat_bubble_outline0

account_circle

Ethan Mollick

1 month ago

In every group I speak to, from business executives to scientists, including a group of very accomplished people in Silicon Valley last night, much less than 20% of the crowd has even tried a GPT-4 class model.

Less than 5% has spent the required 10 hours to know how they tick.

thumb_up_off_alt1,8K

chat_bubble_outline0

account_circle

david rein

2 months ago

Claude 3 gets ~60% accuracy on GPQA. It's hard for me to understate how hard these questions are—literal PhDs (in different domains from the questions) with access to the internet get 34%.

PhDs *in the same domain* (also with internet access!) get 65% - 75% accuracy.

Claude 3 gets ~60% accuracy on GPQA. It's hard for me to understate how hard these questions are—literal PhDs (in different domains from the questions) with access to the internet get 34%. PhDs *in the same domain* (also with internet access!) get 65% - 75% accuracy.

thumb_up_off_alt1,5K

chat_bubble_outline0

account_circle

Ethan Mollick

2 months ago

This whole piece by computer scientist Scott Aronson is interesting.

But I think these sections are helpful because they captures the level of surprise I often see among experts about how well LLMs work in practice & the uncertainty over what comes next. scottaaronson.blog

This whole piece by computer scientist Scott Aronson is interesting. But I think these sections are helpful because they captures the level of surprise I often see among experts about how well LLMs work in practice & the uncertainty over what comes next. scottaaronson.blog

thumb_up_off_alt400

chat_bubble_outline0

account_circle

Min Choi

2 months ago

Sora by OpenAI is insane.

But it doesn't just generate AI videos from text, it can also change the styles and environments of input videos🤯

12 wild examples:

First, Input video 🧵👇

thumb_up_off_alt12,0K

chat_bubble_outline0

account_circle

Byron Crowe

2 months ago

The finding that 54% of medical students in this global survey viewed medical training as a stepping stone to a non-clinical career is WILD.

There’s nothing wrong with wanting to take that path, but wow - that’s way more than I would have anticipated.

thumb_up_off_alt1

chat_bubble_outline0

account_circle

Eric Topol

2 months ago

Use of #AI synthetic notes >300,000 clinic conversations, >3,400 PermanenteDoctors catalyst.nejm.org/doi/full/10.10…
—81% Physicians-less screen time during visits and less 'pajama time' work on EHR
—71% Patients-more time spent speaking w/ physicians
—Audit of random 35 notes-high quality

thumb_up_off_alt234

chat_bubble_outline0

account_circle

Rebecca Mitchell, MD

@RebeccaCoelius

3 months ago

Halle Tecco MBA, MPH Sergei Polevikov Just this year through VC backed digital health:

1. Saved $300 on a prescription with GoodRx
2. Was FINALLY properly diagnosed with early menopause and experienced life altering symptom improvement on low dose estrogen through Midi
3. Using DTC lab ordering and a lipidoligist I…

thumb_up_off_alt75

chat_bubble_outline0

account_circle

Jonathan H Chen MD PhD

3 months ago

#AI 's black box dissuades many in high-stakes (medical) applications. With purposeful prompting methods, large language models can mimic clinical reasoning processes on their way to delivering answers. Tom Savage, Ashwin Nayak, Rob Gallo, Ekanath Rangan
nature.com/articles/s4174…

#AI's black box dissuades many in high-stakes (medical) applications. With purposeful prompting methods, large language models can mimic clinical reasoning processes on their way to delivering answers. Tom Savage, Ashwin Nayak, Rob Gallo, Ekanath Rangan nature.com/articles/s4174…

thumb_up_off_alt52

chat_bubble_outline0

account_circle

Mario Schlosser

3 months ago

Data from the Oscar Medical Group's AI-written messaging summaries: the time it takes to document visits to virtual care providers via secure messages dropped by around 30% with automated summaries. The chart below shows how providers are using the AI-written summaries.…

Data from the Oscar Medical Group's AI-written messaging summaries: the time it takes to document visits to virtual care providers via secure messages dropped by around 30% with automated summaries. The chart below shows how providers are using the AI-written summaries.…

thumb_up_off_alt61

chat_bubble_outline0

account_circle

Isaac Kohane

3 months ago

Evidence of AI accelerating AI: AMIE from @google is a tour de force for the AIM community bit.ly/3tTVLhE Most striking is the multiple places in which LLMs are used to critique/augment/fine-tune other LLM's bypassing hugely expensive human assessments PRIOR to RCT.

Evidence of AI accelerating AI: AMIE from @google is a tour de force for the AIM community bit.ly/3tTVLhE Most striking is the multiple places in which LLMs are used to critique/augment/fine-tune other LLM's bypassing hugely expensive human assessments PRIOR to RCT.

thumb_up_off_alt59

chat_bubble_outline0

account_circle

Katharina Schmack

4 months ago

Inspired by recent posts, here is my Christmas gift for you: the worst postdoc interview ever (without gifs because I am old but not that old).

thumb_up_off_alt553

chat_bubble_outline0

account_circle

Mario Schlosser

4 months ago

Large-scale AI models are a once-in-a-generation opportunity to improve healthcare. 28 of the most forward-thinking payers and providers got together to figure out how we leverage frontier AI models to drive the change we want to see in healthcare. Here are our commitments:…

Large-scale AI models are a once-in-a-generation opportunity to improve healthcare. 28 of the most forward-thinking payers and providers got together to figure out how we leverage frontier AI models to drive the change we want to see in healthcare. Here are our commitments:…

thumb_up_off_alt157

chat_bubble_outline0

account_circle

Nick Mark MD

4 months ago

Folks don’t give epi or vasopressin for someone “going into Afib.” 😂

thumb_up_off_alt265

chat_bubble_outline0

account_circle

Byron Crowe

4 months ago

I loved seeing this paper expanding on our prior work evaluating LLM diagnostic performance, and especially appreciated that the Google team gave me and Zahir Kanjee MD, MPH, FACP a shout out as the amorphous “human raters” who evaluated GPT-4’s ddx.

Alan Karthikesalingam call us anytime 😂

I loved seeing this paper expanding on our prior work evaluating LLM diagnostic performance, and especially appreciated that the Google team gave me and @zahirkanjee a shout out as the amorphous “human raters” who evaluated GPT-4’s ddx. @alan_karthi call us anytime 😂

thumb_up_off_alt4

chat_bubble_outline0

account_circle

MatthewBerman

4 months ago

Ever wonder what an LLM actually looks like under the hood?

This visualization is mesmerizing.

Check it out yourself: bbycroft.net/llm

thumb_up_off_alt1,2K

chat_bubble_outline0

account_circle

Emily Moin

4 months ago

Please show this to anyone who says stuff like “because of Ozempic there won’t be overweight people anymore.” We’ve had plenty of miracle drugs before and can still barely get people to take them!

thumb_up_off_alt32

chat_bubble_outline0

account_circle

Adam Rodman

5 months ago

Very cool recreation (and improvement) of Zahir Kanjee MD, MPH, FACP Byron Crowe and my study on GPT-4 and NEJM CPCs! (but with a diagnostically fine-tuned PaLM-2)

thumb_up_off_alt18

chat_bubble_outline0

account_circle

fpc ok :)