rimu
@rimu@piefed.social
Developer of PieFed, a sibling of Lemmy & Kbin.
- Comment on When DeepSeek-R1 receives prompts containing topics the CCP considers politically sensitive, the likelihood of it producing code with severe security vulnerabilities increases by up to 50%. 4 days ago:
one possible explanation for the observed behavior could be that DeepSeek added special steps to its training pipeline that ensured its models would adhere to CCP core values. It seems unlikely that they trained their models to specifically produce insecure code. Rather, it seems plausible that the observed behavior might be an instance of emergent misalignment.4 In short, due to the potential pro-CCP training of the model, it may have unintentionally learned to associate words such as “Falun Gong” or “Uyghurs” with negative characteristics, making it produce negative responses when those words appear in its system prompt. In the present study, these negative associations may have been activated when we added these words into DeepSeek-R1’s system prompt. They caused the model to “behave negatively,” which in this instance was expressed in the form of less secure code.
- Comment on search engine megathread? 1 month ago:
Me too!
- Comment on Emoji Recently Added 2 months ago:
I can see "face with bags under eyes" getting a lot of use 😆
- Comment on Zuckerberg says people without AI glasses will be at a disadvantage in the future 3 months ago:
Something tells me that Meta's smart glasses won't have a billboard, signage and poster blocking feature....
- Comment on US Army signs up Band of Tech Bros with a nerdy name 5 months ago:
It is almost comical how fucked this is.
- Comment on Tech stocks tumble as a Chinese competitor threatens to upend the AI industry; Nvidia down 17% 10 months ago:
Yup, I bought $4 more worth of Nvidia.
You can tell when I do investments I play with the big boys