Policy & regulation — korshunov.ai

Topic · Policy & regulation

OpenAI Builds Shared AI Standards via Appia Foundation

OpenAI, through the Appia Foundation, is advancing shared standards for advanced AI by developing evaluation frameworks, safety practices, and promoting global cooperation.

media Don't Worry About the Vase · 6d ago

White House Pauses AI Deployment

The U.S. White House paused the deployment of frontier AI models, including Claude Fable 5 and Claude Mythos 5, citing a reported 'jailbreak' where the AI could identify and fix security vulnerabilities in code. Anthropic has been working with the Trump Administration to resolve the issue, but experts argue that the problem is fundamental—AI either can write secure code or it cannot, making a fix impossible without undermining its defensive capabilities.

arxiv arXiv cs.AI · 1d ago

Machine Whistleblowing: A Normative and Principled Approach

Artificial agents can and should whistleblow, but only within a normative framework rooted in human whistleblowing traditions. The paper calls for government regulators to establish clear guidelines on what machines may disclose and how to legally protect developers of such systems.

arxiv arXiv cs.CL · 2d ago

AI Recommendation Ownership: Empirical Map of Brand Category Ownership

A study of 3,750 queries across five industries finds moderate recommendation concentration, with a mean Gini coefficient of 0.28. Cross-model agreement on top-recommended brands was only 41.6%, and displacement scores varied by industry, ranging from 0.4:1 to 4.3: 1. The results challenge the 'winner-takes-all' narrative and introduce three reproducible metrics for competitive-intelligence analysis.

arxiv arXiv cs.CL · 2d ago

Cognitive Digital Twins: Ethical Risks and Governance

Cognitive digital twins (CDTs) are dynamic computational models of individual cognition, updated from personal data to simulate or act on behalf of users. This paper introduces a 5A governance framework—authority, autonomy, access and control, accountability, and availability—to address ethical risks like misrepresentation, proxy-power asymmetries, and shadow twins, emphasizing the need for governance over cognitive representation itself, not just decision-making or data use.

arxiv arXiv cs.CL · 2d ago

Language shapes historical credit in large language models

A study of 11 large language models across 21 disputed inventions shows that query language systematically influences which inventor is credited. Lower-status claimants appear more frequently when questions are phrased in their native language, while dominant Anglophone figures remain consistent. The findings suggest language acts as a switch that activates distinct national versions of history, indicating that LLMs function as systems of cultural memory.

arxiv arXiv cs.CL · 2d ago

AI-Constructed Brand Reputation Is Language-Bound

AI-generated brand reputations vary significantly by language, with Uralic and Baltic languages showing more positive sentiment and Germanic languages, including English, being more critical. Query language impacts which brands are recommended, especially for local champions, where home-language queries increase visibility by 0.80 points compared to English queries. English-only monitoring fails to capture the full AI visibility of locally headquartered brands, creating a measurable language blind spot.

media Import AI · 3d ago

AI Out-Persuades Humans: New Study Shows AI Superior to Experts

A study by Oxford, Stanford, and LSE researchers finds AI systems consistently out-persuade expert humans across four experiments involving 18,978 conversations. AI exceeded professional canvassers by 10.8 percentage points in real-world donations to Save the Children, with Opus 4.1 and Opus 4.6 showing the strongest persuasion performance.

media Interconnects · 5d ago

Banning Open Source AI Would Be a Mistake

The article argues that banning open source AI would be a grave mistake, as it is safe, secure, and drives innovation, education, and competition. Open source has long powered technological progress and serves as a vital counterweight to monopolistic AI models, ensuring broader access and democratic innovation without compromising safety or security.

media r/LocalLLaMA · 7d ago

Anthropic and Google DeepMind CEOs Call for U.S.-Led AI Coalition at G7

CEOs of Anthropic and Google DeepMind urged the formation of a U.S.-led AI coalition during a G7 meeting. The leaders emphasized the need for coordinated global efforts to ensure responsible AI development and governance.

media r/LocalLLaMA · 7d ago

US holds off blacklisting China's DeepSeek

Sources say the U.S. has delayed blacklisting China's DeepSeek AI firm. More than 100 companies have been deemed security risks in the decision.

arxiv arXiv cs.CL · 8d ago

Measurement Gap in EU Law Automation

Large language models can generate median-quality legal text, but no benchmark evaluates their ability to perform doctrinal legal reasoning. This gap undermines the EU AI Act's requirement of 'appropriate accuracy' in judicial AI, as the necessary operational definition lacks a doctrinal-reasoning evaluation standard.

arxiv arXiv cs.AI · 8d ago

Measurement Gap in EU Law Automation

Large language models can produce median-quality legal text, but no benchmark evaluates their ability to perform doctrinal legal reasoning. This gap undermines the EU AI Act's requirement of 'appropriate accuracy' in judicial AI, as the necessary doctrinal-reasoning evaluation remains absent.

media r/LocalLLaMA · 8d ago

Zhipu surges 33% as Wall Street raises bets on China AI after Anthropic curbs

Zhipu's stock rises 33% following Wall Street's increased interest in China's AI sector. The surge comes after Anthropic, a U.S. AI firm, curtails its operations, prompting market speculation about the competitive dynamics in global AI development.

media r/LocalLLaMA · 8d ago

The Case For Open-Weight Models And Why We Can't Trust Frontier Labs

The article argues for open-weight language models, emphasizing transparency and accessibility. It expresses skepticism toward Frontier Labs, suggesting concerns about their model development and openness.

media r/LocalLLaMA · 20h ago

Bill to Mandate AI Chip Location Tracking Gains Industry Support

Half a dozen companies have expressed support for the Chip Security Act, which would require location-tracking mechanisms on America's most advanced computing chips. The bill aims to enhance security by enabling authorities to track the physical location of high-risk AI chips.

media r/LocalLLaMA · 4d ago

Claude Will Soon Require Identity Verification

Anthropic will soon require users to verify their identity to access Claude. The change is intended to enhance security and ensure responsible use of the platform.

arxiv arXiv cs.AI · 6d ago

Editorial Alignment in LLM-mediated Knowledge Dissemination

A case study with a Nordic public knowledge institution demonstrates how editorial participation can re-align LLM interfaces with editorial standards. The paper introduces editorial alignment as a design practice in Participatory AI, where editorial values are translated into technical alignment objectives. This approach empowers editors with agency in LLM-mediated knowledge dissemination.

media r/LocalLLaMA · 7d ago

Leaked financial docs show OpenAI is losing billions of dollars a year

Leaked financial documents suggest OpenAI is losing billions of dollars annually. The documents, shared on Reddit, claim the losses stem from high research and development costs, though OpenAI has not officially confirmed the data.

arxiv arXiv cs.LG · 8d ago

NYC Congestion Pricing Boosts Transit Use Amid Spatially Uneven Demand Shifts

New York City's 2025 congestion pricing led to significant increases in bus and subway ridership, with gains extending beyond Manhattan's core. Overall travel demand decreased modestly, primarily within the Congestion Relief Zone, and neighborhood-level responses reveal uneven socio-demographic adaptation.