Top AI models fail spectacularly when faced with slightly altered medical questions

Pro@programming.dev · 6 days ago

Top AI models fail spectacularly when faced with slightly altered medical questions

panda_abyss@lemmy.ca · 6 days ago

That’s not an accurate characterization

There are LLMs trained on brute forced sets of lemmas, which then are able to predict new ones, and there are “regular” models evaluated on math that are able to create new theorems based on prompting plus their latent parameters.

Top AI models fail spectacularly when faced with slightly altered medical questions

Top AI models fail spectacularly when faced with slightly altered medical questions

Just a moment...