Trevor Klee’s Newsletter

Richard Monson-Haefel

Aug 28

I don't agree. When I was in college (a millennium ago), I hired people to spell check, fix my grammar, and type up my papers. I don't need people to do that today, and no one makes money at it any longer. You may not notice it, but these things that make some lives easier make others obsolete.

Expand full comment

John DesJardins

Aug 28

Well, if you had enough $$$ even in college to hire those people, you come from a different world than most.

Most people went from doing their own typing, spell checking and grammar to gradually being more productive, thanks to MS Word and Google docs which have made those things automated for over 10 years and made their users more productive. Even when I've had executive assistants, I never relied on them to do these things. So, this is a productivity bump, not a reason to fire people.

Expand full comment

ΟΡΦΕΥΣ

Aug 26

I'm so ridiculously excited for Opus 3.5 that I'm pre-emptively worried it can't live up to my (entirely self-generated) hype.

Expand full comment

Austin Morrissey

Claude is also my most frequent flyer for anything science. Gemini does ok too for Q&A on bench techniques

Expand full comment

Manjari Narayan

Aug 20Edited

I recently got Claude to act like a statistical reviewer on a blog post with terrible NHST style thinking in evaluating LLM safety. It picked out all the issues I identified and explained them like I asked it to. Compared to last year, I think the chatbots have come a long way in being helpful for speeding up technical work.

That said these agents also repeat a lot of non-sense and myths by default. There are too many flat out wrong or misleading blog posts and papers on statistical thinking out there. You do have to bias them in the right direction with prompts.

Expand full comment

Yup. First time around with the statistics Claude took some shortcuts that led to quite wrong answers. I had to redirect it and then it did ok, but still gave slightly wrong but good enough answers. I'm not sure exactly where the errors occurred the second time, which would have been a problem if I hadn't had any expert reviewer.

Expand full comment

Austin Weisgrau

I'm not a statistician but I know enough statistics to know that you really don't want to outsource your statistics work. subject matter experts are paid handsomely for a good reason on that one. Very easy to get very wrong.

Expand full comment

True for pretty much everything I pay people for, to be honest. But yes, statistics in trials especially isn't just blindly applying formulae. Claude is ok at knowing what to do but I still would not trust it to do stats on its own for anything important.

Expand full comment

Manjari Narayan

Yeah it doesn't really know how to handle bespoke situations correctly. But with where automated data analysis is going, I expect that will start to change soon.

Even in the published literature there is a notable discrepancy between quality of statistical reviewing in top clinical journals vs. say a statistical reviewer of the same study for an FDA submission. The latter are way more accurate because they often go so far as to re-analyze the data from scratch! So you have to ask Claude not to do pervasive but wrong ways of doing trials and analyzing them.

Expand full comment

Tamara Krinsky

Aug 27

This may be a dumb question, but if you're not reading the source material, how do you know when the results that Claude gives you are incorrect? This is one of my biggest fears about using AI - that I can't trust the results.

Expand full comment

Aug 27

I usually go back and check the sources.

Expand full comment

Richard Sprague

Aug 21

Why Claude and not ChatGPT or Gemini (or its various other competitors? I've been using ChatGPT Pro and loving it, but thinking of switching.

Expand full comment

Aug 21

Just by personal experience, Claude seems to be smarter and better at reasoning.

Expand full comment