Anthropic, Fable and guardrails
Digest more
Tech Xplore on MSN
Mathematical proof reveals why fixed AI guardrails can never block every jailbreak
Can we make artificial intelligence impervious to adversaries who want to twist the technology to nefarious ends? Though AI is among the newest of technologies, the answer to that question is nearly a century old.
The result is correct but challenges core norms of mathematics: checking proofs, crediting ideas and keeping research open to everyone.
Democratic senators are hoping to add guardrails on the military’s AI use to an annual defense policy bill as the House Armed Services Committee prepares to debate the massive legislation on Thursday.
Researchers found open-weight AI models from Google and Meta could have guardrails removed in minutes, raising new safety concerns.
Open-weight AI models with advanced capabilities and no safeguards are becoming much more accessible. While they can be useful, AI safety experts have concerns.
AI policy groups are urging leaders on the House and Senate Armed Services Committees to add guardrails to an annual defense policy bill on the military’s use of lethal autonomous weapons. Americans for Responsible Innovation,
US President Donald Trump said he discussed guardrails on artificial intelligence with Chinese leader Xi Jinping, while adding that Nvidia Corp.’s H200 chips also came up during a two-day summit in Beijing.