Logical Thinking Performance Task

Deepseek-r1 vs OpenAI-o1 – AI Reasoning Performance Comparison

Deepseek, a Chinese company, has introduced its Deepseek R1 model, attracting attention for its potential to rival OpenAI’s latest offerings. Reportedly outperforming OpenAI’s o1 Preview in benchmarks ...

Geeky Gadgets

Unlock the Full Power of DeepSeek R1 by Fine-Tuning Its Reasoning Tasks

Fine-tuning a large language model (LLM) like DeepSeek R1 for reasoning tasks can significantly enhance its ability to address domain-specific challenges. DeepSeek R1, an open source alternative to ...

VentureBeat

Alibaba's new open source model QwQ-32B matches DeepSeek-R1 with way smaller compute requirements

Qwen Team — a division of Chinese e-commerce giant Alibaba developing its growing family of open-source Qwen large language models (LLMs) — has introduced QwQ-32B, a new 32-billion-parameter reasoning ...

EurekAlert!

Large language models demonstrate strong performance in physicians’ clinical reasoning tasks

A cutting-edge large language model (LLM) outperformed human doctors in common clinical reasoning tasks including emergency room decisions, identifying likely diagnoses, and choosing next steps in ...

Medical Xpress

AI surpasses physicians on clinical reasoning tasks, raising the bar for more serious testing

In one of the largest studies to compare artificial intelligence and physicians on a wide array of clinical reasoning tasks including real emergency department data, a team of physicians and computer ...

Medical Xpress

How the brain deploys different reasoning strategies to tackle challenging mental tasks

The human brain is very good at solving complicated problems. One reason for that is that humans can break problems apart into manageable subtasks that are easy to solve one at a time. This allows us ...

VentureBeat

Meta researchers distill System 2 thinking into LLMs, improving performance on complex reasoning

Want smarter insights in your inbox? Sign up for our weekly newsletters to get only what matters to enterprise AI, data, and security leaders. Subscribe Now Large language models (LLMs) are very good ...

Nature

Human-like intuitive behavior and reasoning biases emerged in large language models but disappeared in ChatGPT

We design a battery of semantic illusions and cognitive reflection tests, aimed to elicit intuitive yet erroneous responses. We administer these tasks, traditionally used to study reasoning and ...

Nature

DR-CoT: dynamic recursive chain of thought with meta reasoning for parameter efficient models

Recent breakthroughs in natural language processing (NLP) 1,2,3 have showcased the exceptional capabilities of large language models (LLMs), including LLaMA3 4, GPT-4 5, and GPT-3.5 6, in reasoning ...

Euronews

AI models rival doctors on complex medical reasoning tasks, study finds

Researchers have found that an AI model outperformed human doctors on most medical reasoning tasks, from diagnoses to patient management advice. Artificial intelligence models outperformed physicians ...

Computerworld

Microsoft introduces Phi-4, an AI model for advanced reasoning tasks

Microsoft has announced Phi-4 — a new AI model with 14 billion parameters — designed for complex reasoning tasks, including mathematics. Phi-4 excels in areas such as STEM question-answering and ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results