RE: RE: Advanced Large Language Models Are Capable And Prone to In-Context Scheming

You are viewing a single comment's thread from:

RE: Advanced Large Language Models Are Capable And Prone to In-Context Scheming

View the full context

RE: Advanced Large Language Models Are Capable And Prone to In-Context Scheming

gadrian(76)

Published in

The Mindful AI

Words

0

Reading

0 min

Listen

Play

Thursday, February 13, 2025 6:35 PM

Arthroscopic reasoning models have also been caught ignoring certain safeguards and intentionally lying when they thought it was the best course of action to not be updated during the post-training phase.

hive-196902

Thursday, February 13, 2025 6:35 PM

gadrian(76)

via peakd

This post was published via peakd. Ecency is not the originator or editor of this content and displays it for discovery purposes only.

$ 0.000

1