DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning

www.nature.com

DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning

www.nature.com

paywallMB to

NatureEnglish · 5 days ago

Nature, Published online: 17 September 2025; doi:10.1038/s41586-025-09422-z

A new artificial intelligence model, DeepSeek-R1, is introduced, demonstrating that the reasoning abilities of large language models can be incentivized through pure reinforcement learning, removing the need for human-annotated demonstrations.

From Nature via this RSS feed

You must log in or # to comment.

Chat

Nature

nature

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !nature@ibbit.at

Community locked: only moderators can create posts. You can still comment on posts.

Nature is a weekly international journal publishing the finest peer-reviewed research in all fields of science and technology on the basis of its originality, importance, interdisciplinary interest, timeliness, accessibility, elegance and surprising conclusions. Nature also provides rapid, authoritative, insightful and arresting news and interpretation of topical and coming trends affecting science, scientists and the wider public.

Don’t post archive.is links or full text of articles, you will receive a temp ban.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
1 user / week
7 users / month
20 users / 6 months
1 local subscriber
11 subscribers
887 Posts
7 Comments
Modlog