Latent Space — korshunov.ai

Source · Latent Space

AI Red Teaming and Prompt Injection Risks Explained

Zico Kolter and Matt Fredrikson, co-authors of the definitive paper on indirect prompt injections and authorities on the Mythos model, discuss the growing risks of AI security. They highlight that AI systems require a distinct security mindset, with agents introducing new vulnerabilities, and that specialized red-teaming AI can outperform humans in breaking models, making AI prompt injection breaches increasingly likely.

media Latent Space · 6d ago

GLM-5.2 Passes Vibe Check, Outperforms GPT-5.5

GLM-5.2 has passed a 'vibe check' as a frontier open model, receiving praise from Jeremy Howard and outperforming GPT-5.5 in Artificial Analysis' new knowledge work benchmark. It also gained validation from the /r/LocalLlama community, indicating strong real-world utility and performance.

media Latent Space · 6d ago

Why AI Scaling Is a Systems Problem, Not Just a GPU Race

The AI scaling debate overlooks that maximizing model FLOP utilization is more critical than buying more GPUs. Frontiers like xAI operate at sub-10% MFU, while historical models achieved 21% to 70% MFU, indicating systemic inefficiencies in scheduling, networking, and cluster management. Anjney Midha argues that AI infrastructure must evolve into efficient, aligned, and responsible systems, with 'output maxing' emerging as a new discipline for frontier AI.

media Latent Space · 7d ago

Radical AI Achieves 10x Acceleration in Materials Discovery

Radical AI has accelerated materials discovery by producing and characterizing 1,200 alloys in six months—nearly 10x faster than DARPA/GE MACH's goal of 500 alloys in a year. Their self-driving labs use AI scientists to generate and test hypotheses in closed-loop systems, leading to 300 new materials with 10 exhibiting novel, state-of-the-art properties now being developed for commercial use.

media Latent Space · 8d ago

GLM-5.2 Claims Top Position in Frontend Coding with Speculative Decoding

GLM-5.2, a 744B parameter model from Z.ai, has been evaluated as the top frontend coding model globally, outperforming all Opus versions including Opus 4.8. This achievement is highlighted in third-party evaluations that validate official offline tests, marking a significant milestone for a model of its size, particularly in the competitive frontend coding domain.

media Latent Space · 9d ago

Satya Nadella on Loopcraft and Frontier Ecosystems

Microsoft CEO Satya Nadella introduces 'Loopcraft' as a new theory of the firm, emphasizing that the real opportunity in AI lies not in selecting the best model, but in building learning loops that compound human and token capital. He asserts that the priority must be creating frontier ecosystems where every organization can own and grow its institutional knowledge, enabling broad value flow across industries and countries.

media Latent Space · 4d ago

Exclusive: $250 Off AI Engineer Tickets Until Monday

LS paying subscribers can access a $250 discount on AI Engineer event tickets. The offer was previously announced in AINews and is available to those who have opted in to receive AINews updates.

media Latent Space · 5d ago

Latent Space Subscribers Get $250 Discount for AIE WF 2026

Latent Space subscribers receive a limited-time $250 discount on AIE WF 2026 tickets. Attendees also receive $40k in sponsor credits from companies like Warp, Datadog, SourceGraph, Stripe, and Fireworks.

media Latent Space · 7d ago

Midjourney Launches Full-Body Ultrasound CT Scanner

Midjourney has announced a full-body ultrasound CT scanner, calling it the first new whole-body medical imaging modality in 50 years. The prototype, known as the Midjourney Scanner, uses 8,960 transducers across 40 systems in a 70 cm ring to capture data at 17 GB/s, with claimed resolution down to 0.5 mm and a goal of 358,000 ultrasonic elements. The system is currently in Gen 1, with scans taking 20 minutes and no AI used in image generation yet, though future versions aim to integrate AI and reach 50,000 scanners by enabling 1 billion scans monthly.