Alibaba'S Qwen Team Makes Ai Models Think Deeper with New Algorithm
Alibaba's Qwen team has developed a new algorithm that enables AI models to think more deeply by assigning different rewards to each step of the reasoning process. The current approach to reinforcement learning, which rewards every token equally, has been shown to limit the length of thought processes. The Qwen team's new algorithm, however, allows each step to be weighted based on its impact on subsequent steps, effectively doubling the length of thought processes. This breakthrough has the potential to significantly improve the performance of AI models in real-world applications. The Qwen team's innovation is a significant step forward in the development of more sophisticated AI systems. The new algorithm has already shown promising results in preliminary testing, and it is expected to be integrated into Alibaba's existing AI platforms. As AI continues to play an increasingly important role in various industries, the Qwen team's work has the potential to drive significant advancements in areas such as natural language processing and computer vision.
Original Sources
Tags
More in Models & Research
Researchers Introduce Artifact-based Agent Framework for Reproducible Medical Image Processing
Researchers have developed an artifact-based agent framework for adaptive and reproducible medical image processing.
Anthropic Says Stronger AI Models Cut Better Deals, Losers Unaware
Anthropic conducted an experiment with 69 AI agents trading on behalf of employees, finding that stronger models secured better deals, with weaker models' users unaware of the difference.
AI-Based Automated Course of Action Generation System for Military Operations
Researchers have developed an AI-based system for generating automated courses of action for military operations.