The Obtain: Understanding deep matter, and AI jailbreak safety

We are able to put an excellent determine on how a lot we all know in regards to the universe: 5%. That’s how a lot of what’s floating about within the cosmos is odd matter—planets and stars and galaxies and the mud and gasoline between them. The opposite 95% is darkish matter and darkish power, two mysterious entities aptly named for our incapacity to make clear their true nature.

Earlier work has begun pulling aside these dueling forces, however darkish matter and darkish power stay shrouded in a blanket of questions—critically, what precisely are they?

Enter the Vera C. Rubin Observatory, one in every of our 10 breakthrough applied sciences for 2025. Boasting the most important digital digicam ever created, Rubin is predicted to review the cosmos within the highest decision but as soon as it begins observations later this yr. And with a greater window on the cosmic battle between darkish matter and darkish power, Rubin would possibly slim down present theories on what they’re fabricated from. Right here’s a take a look at how.

—Jenna Ahart

This story is a part of MIT Expertise Assessment Explains, our collection untangling the advanced, messy world of know-how that can assist you perceive what’s coming subsequent. You may learn extra from the collection right here.

Anthropic has a brand new option to shield massive language fashions in opposition to jailbreaks

What’s new? AI agency Anthropic has developed a brand new line of protection in opposition to a typical form of assault referred to as a jailbreak. A jailbreak methods massive language fashions (LLMs) into doing one thing they’ve been skilled to not, comparable to assist anyone create a weapon. And Anthropic’s new method could possibly be the strongest defend in opposition to the assaults but.

How they did it: Jailbreaks are a form of adversarial assault: enter handed to a mannequin that makes it produce an surprising output. Regardless of a decade of analysis there’s nonetheless no option to construct a mannequin that isn’t weak. However, as an alternative of attempting to repair its fashions, Anthropic has developed a barrier that stops tried jailbreaks from getting via and undesirable responses from the mannequin getting out. Learn the complete story.