人工智能行业最新动态
Synkra AIOS: AI-Orchestrated System for Full Stack Development - Core Framework v4.0
A toolkit to make debugging iOS applications easier 🚀
Chrome DevTools for coding agents
slime is an LLM post-training framework for RL Scaling.
Agentic AI Infrastructure for magnifying HUMAN capabilities.
<!-- SC_OFF --><div class="md"><p>Hi everyone,</p> <p>I am a 2nd year Computer Science student currently benchmarking State Space Models (Mamba-S6) against LSTMs on adversarial time-series tasks. I observed a significant divergence in how they handle signal...
LlamaIndex 与 PostHog 合作推出 LLM 分析功能,支持自动追踪 OpenAI Token 消耗、成本和延迟指标,帮助开发者监控 Agent 工作流性能。
论文揭示,长期以来被认为振幅为零的特定胶子相互作用(树级单负螺旋度),在粒子运动满足特定对齐条件时实际并非为零。这一发现纠正了物理学界数十年的假设。
OpenAI 与物理学家合作发表预印本论文,GPT-5.2 成功简化了胶子相互作用的复杂表达式,并推测出适用于任意数量胶子的通用公式。另一个 OpenAI 内部模型经约 12 小时推理独立推导出相同公式。
n8n 发布深入技术教程,由核心成员 Max 演示如何构建带知识库的问答 AI Agent,涵盖数据项和循环等大多数教程忽略的关键基础概念。
swyx 认为 Agent 实验室和开源开发者相比模型大厂有两大优势:可以在所有模型中取最优(argmax),且无需受安全审查约束自由探索能力边界。
OpenAI 与物理学家合作发表预印本论文,GPT-5.2 简化了胶子相互作用的复杂表达式并推导出通用公式,推翻了粒子物理中「单负振幅为零」的长期假设。另一内部模型独立推导出相同结论。
Browserbase 联合 Cerebras 推出浏览器 Agent 模板,使用最新开源模型批量启动浏览器爬取文档并验证与代码库的一致性,数分钟内完成。
swyx 在 Latent Space 播客发布对 Google DeepMind 负责人 Jeff Dean 的深度采访,涉及 Gemini Deep Think、Gemini Ultra 去向以及「AI 工程师必知数字」等话题。
Replit 推出反馈组件功能,用户在已发布应用中提建议后,Agent 可自动将其转化为已上线的功能,实现需求闭环。
新开源项目 Forge 发布,提供可扩展的 Agent 强化学习框架和算法,为构建 AI Agent 提供训练基础设施。
MiniMax-M2.5 模型权重已发布到 Hugging Face,同时提供 API 服务,开发者可直接下载使用。
MiniMax 发布 M2.5 模型,在编码、Agent 工具调用和办公场景中达到 SOTA 水平。该模型通过大规模真实环境 RL 训练,具备架构级编程能力和高效搜索推理,SGLang 已提供 Day-0 支持。
<table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1r3t775/ama_with_minimax_ask_us_anything/"> <img...
<!-- SC_OFF --><div class="md"><p><em>Made with</em> <a href="https://paperglide.net/"><em>Paperglide</em></a> <em>✨ — digest research papers faster</em></p> <p><strong>TL;DR:</strong>...
<!-- SC_OFF --><div class="md"><p>So I am using a R(2+1)D with kinetics 400 weights to train a classifier on two sets of videos. The problem is that one of the two classes has all videos of the same resolution and fps, forcing the model to learn those features instead of...
<!-- SC_OFF --><div class="md"><p>I thought the reviewing period should have started yesterday, but it still says &quot;You have no assigned papers. Please check again after the paper assignment process is complete.&quot; </p> </div><!-- SC_ON...
<!-- SC_OFF --><div class="md"><p>I released a new version of my side project: SoproTTS</p> <p>A 135M parameter TTS model trained for ~$100 on 1 GPU, running ~20× real-time on a base MacBook M3 CPU.</p> <p>v1.5 highlights (on CPU):</p>...
<!-- SC_OFF --><div class="md"><p>I&#39;m currently making a baseline autoencoder for this super freaking huge hyperspectral image dataset I have. It&#39;s a really big pain to work with and to get decent results, and I had to basically pull all stops including...
<!-- SC_OFF --><div class="md"><p>This post details my exploration for a &quot;stable stack&quot; for streaming deep RL (ObGD, SparseInit, LayerNorm, and online normalization) using 433,000 observations of real, non-stationary SSH attack traffic.</p>...
<!-- SC_OFF --><div class="md"><p>We evaluated 22 model configurations across different effort/thinking levels on Deep Research Bench (169 web research tasks, human-verified answers). For two of the most capable models, higher effort settings scored worse. </p>...
<!-- SC_OFF --><div class="md"><p>I’m reviewing for ICML (Policy A, where LLM use is not allowed) and noticed that in my assigned batch, if you copy/paste the full PDF text into a text editor, every single paper contains prompt-injection style instructions embedded...
I've been working on CloudRouter, a skill + CLI that gives coding agents like Claude Code and Codex the ability to start cloud VMs and GPUs.<p>When an agent writes code, it usually needs to start a dev server, run tests, open a browser to verify its work. Today that all happens on your local...
I'm not worried about AI job loss
Previously:<p><i>An AI agent published a hit piece on me</i> - <a href="https://news.ycombinator.com/item?id=46990729">https://news.ycombinator.com/item?id=46990729</a> - Feb 2026 (916 comments)<p><i>AI agent opens a PR write a blogpost to shames the maintainer who...