Hackers News7B Model and 8K Examples: Efficient and Effective Emerging Reasoning with RL adminJanuary 25, 202501 mins Comments Post navigation Previous: The South Vietnamese Pilot Who Performed a Daring Feat To Save His FamilyNext: DeepSeek R1’s bold bet on reinforcement learning: How it outpaced OpenAI at 3% of the cost Leave a ReplyYou must be logged in to post a comment.
The one change that worked: I set my phone to ‘do not disturb’ three years ago – and have never looked back | Health & wellbeing February 16, 2025