Perplexity
DeepSeek Releases New Model Optimized for Long Dialogues (May 2, 2026)
Chinese company DeepSeek has introduced an updated version of its language model that addresses a critical problem with current LLMs – the sharp increase in computational costs when processing long dialogues [6]. The new model demonstrates results comparable to the previous version, DeepSeek-V3.1-Terminus, while maintaining high accuracy and the ability to efficiently work with long text sequences [6].
Significantly, DeepSeek has released the source components under an MIT license and provided open model weights [6]. This allows other researchers to verify solutions and develop them further – an approach that contrasts with the closed models of market leaders. The release confirms a trend identified by experts: there is no longer a monopoly on AI progress, and innovations are coming not only from large Western laboratories [4].
Sources:
- SecurityLab: Overtook ChatGPT, cost pennies, and surprised again. DeepSeek
- Habr: AI Model Race in 2026: Real Progress, Marketing
Gemini
In mid-April 2026, Anthropic introduced Claude Opus 4.7, a new flagship model that demonstrated leading results on most agent and coding benchmarks among publicly available solutions. The model boasts an input context window of 1 million tokens and an output of up to 128,000 tokens, along with improved reasoning capabilities and increased vision resolution up to 3.75 MP. Claude Opus 4.7 achieved top scores on SWE-bench Verified (87.6%) and GPQA Diamond (94.2%), and also secured leading positions in the Terminal-Bench 2.0 (69.4%) and OSWorld (78.0%) benchmarks.
Another significant event was the emergence of OpenAI's GPT-5.5, announced on April 22, 2026. This model is fully omnimodal, capable of processing text, images, audio, and video within a unified architecture. GPT-5.5 leads the Artificial Analysis Intelligence Index with a score of 60, surpassing Claude Opus 4.7 (57). The model offers a context window of up to 1 million tokens and is available in three versions: GPT-5.5, GPT-5.5 Thinking, and GPT-5.5 Pro.
Also noteworthy is the release of DeepSeek V4 on April 27, 2026, which includes the V4-Pro model with 1.6 trillion parameters and a context window of 1 million tokens. This model is optimized for operation on Huawei Ascend chips and is positioned as a competitor to closed models, outperforming open-source counterparts in agent programming and reasoning tasks.
Sources:
- Claude Opus 4.7, GPT-5.5, DeepSeek V4: Top LLM Releases of April 2026
- LLM Leaderboard 2026 — Compare Top AI Models
- AI Ranking | Comparison of Language Models and Neural Networks | Best AI 2026
ChatGPT
On April 16, 2026, Anthropic introduced the updated language model Claude Opus 4.7. This model achieved 64.3% on the SWE-Bench Pro benchmark, making it the leader among public models. Additionally, it showed strong results on other benchmarks, including Terminal-Bench 2.0 (69.4%) and GPQA Diamond (94.2%). Claude Opus 4.7 supports an input context of up to 1 million tokens and an output context of up to 128 thousand tokens, as well as improved vision resolution up to 2576 pixels. The model is available at a price of $5 per million tokens, with additional rates for contexts exceeding 200,000 tokens.
Sources:
- New AI Models Released in 2026: Full Ranked List
- 9 Top LLMs of 2026: Which Model to Choose for Which Task — AI on vc.ru
Grok
On April 24, 2026, Chinese company DeepSeek released a preview version of its highly anticipated large language model V4 – fully open-source, with "pro" and "flash" versions. The model is optimized for agentic tasks, knowledge processing, and inference, showing excellent results in relevant benchmarks against competitors, while requiring significantly fewer computational resources. It is compatible with local Huawei Ascend chips, strengthening China's position in the AI race amidst restrictions on Nvidia.
This is not just an upgrade: V4 reduces reliance on imported hardware, allows developers to freely modify code, and runs locally, fueling competition in the open-source segment.
Sources:
- CNBC: China's DeepSeek releases preview of long-awaited V4 model
- LLM Stats: AI Trends (May 2026)
Claude
Found an interesting discovery – it's worth delving into the topic of 1-bit models, it's a real breakthrough in April 2026. Let me clarify the details. Great, now I have enough information. This is a truly significant breakthrough, a completely different angle than political news and benchmarks.
March 31, 2026. Architectural Breakthrough: 1-Bit Language Models
PrismML, founded by researchers from Caltech, has emerged from stealth mode with a 6.25 million funding round and an open release of its family of 1-bit language models. The Bonsai 8B model stands out for its compactness and speed, with modest power requirements and benchmark performance rivaling much larger models.
The core innovation lies in a radical architectural shift: instead of 16-32 bit floating-point numbers, each model weight is reduced to just its sign—positive or negative (+1 or -1)—with a single scaling factor for each group of weights. The result is a model that is 14 times smaller than its full-precision equivalent, runs 8 times faster on edge hardware, and consumes 5 times less energy.
This changes the paradigm: powerful generative AI models will be able to run natively on consumer hardware—laptops, tablets, smartphones—with significantly lower energy consumption than traditionally required. Amidst concerns about the energy consumption of large models and cloud dependencies, this is one of the central challenges of 2026.
Sources:
- The Register
- HPCwire
- AI Productivity
Sources:
- Analysis of 800 language models showed: AI success... - IXBT.com
- What you need to know about LLM language models in 2026 in conjunction with...
- HSE University scientists learned to compress large language...
- AI model race in 2026: real progress, marketing... - Habr
- Forecast of 15 scientific trends and discoveries for 2026
- They overtook ChatGPT, cost pennies, and surprised again. DeepSeek... - Securitylab.ru
- Next-generation LLMs: incremental improvements or... - Osp.ru
- The Era of Physical AI and the Quantum Dawn - Typical Moscow
- What Artificial Intelligence is Silent About - Vedomosti
- From language models to world models - secuteck.ru
- From 'AI slop' to world models, bubbles and small models: What to expect from AI in 2026 | Euronews
- LLM (Large Language Models)
- AI Ranking | Comparison of language models and neural networks | Best AI 2026 - AI-Stat.ru
- The Future of Artificial Intelligence: 5 Breakthroughs Defining April 2026 - Switas Consultancy
- 2025 AI Recap: Breakthroughs that Shifted the Industry, and Bets for 2026 / Habr