← Back to Library

Deep Learning Weekly: Issue 432

Deep Dives

Explore related topics with these Wikipedia articles, rewritten for enjoyable reading:

  • Reinforcement learning from human feedback 13 min read

    Multiple papers in this issue discuss reinforcement learning strategies for improving AI model performance, including HunyuanOCR's use of RL for OCR tasks and the General Agentic Memory paper's mention of end-to-end optimization through RL. Understanding RLHF provides crucial context for how modern AI systems are trained and refined.

  • Optical character recognition 11 min read

    The HunyuanOCR technical report is a featured paper discussing a vision-language model dedicated to OCR tasks. While readers may use OCR tools, the Wikipedia article covers the deep history, technical approaches, and evolution from template matching to neural networks that provides valuable context.

  • Transformer (deep learning) 15 min read

    The article references Vision Transformers (ViT) in HunyuanOCR and discusses various large language models. Understanding the foundational transformer architecture—attention mechanisms, encoder-decoder structures, and why it revolutionized NLP and computer vision—provides essential context for nearly every topic covered.

This week in deep learning, we bring you Claude Opus 4.5, Continuous batching from first principles, and a paper on HunyuanOCR Technical Report.

You may also enjoy Introducing SAM 3D: Powerful 3D Reconstruction for Physical World Images, Disrupting the first reported AI-orchestrated cyber espionage campaign, a paper on General Agentic Memory Via Deep Research, and more!

As always, happy reading and hacking. If you have something you think should be in next week’s issue, find us on Twitter: @dl_weekly.

Until next week!


Industry

Introducing Claude Opus 4.5 \ Anthropic

Anthropic released Claude Opus 4.5, the company’s most intelligent model yet with state-of-the-art performance in coding and agentic tasks.

Introducing SAM 3D: Powerful 3D Reconstruction for Physical World Images

The Meta AI team announced SAM 3D, a release that includes SAM 3D Objects for scene reconstruction and SAM 3D Body for human body estimation.

Expanding data residency access to business customers worldwide

OpenAI announced that eligible customers using ChatGPT Enterprise, ChatGPT Edu, or the API Platform in many global markets can now choose local data residency.

Fara-7B: An Efficient Agentic Model for Computer Use

The Microsoft AI team announced Fara-7B, their first agentic SLM designed specifically for computer use.

Olmo 3: Charting a path through the model flow to lead open-source AI

Ai2 releases OLMo 3, a fully open language model family with complete training data and development pipeline transparency.

FLUX.2: Frontier Visual Intelligence

Black Forest Labs releases FLUX.2, a new image generation model with multi-reference support and 4MP resolution editing.

MLOps & LLMOps.

Antigravity and PostgreSQL: No gravity, only vibes | by MCP Toolbox for Databases

An article about using Google Antigravity IDE and Gemini 3 with Model Context Protocol to streamline PostgreSQL database development through natural language interactions.

Introducing agentic search in OpenSearch: Transforming data interaction through natural language

An introduction to agentic search in OpenSearch 3.3, which uses LLM-powered agents and tools to transform data interaction.

Learning

Disrupting the first reported AI-orchestrated cyber espionage campaign \ Anthropic

A critical security report detailing the disruption of the first reported large-scale AI-orchestrated cyber espionage campaign, where a state-sponsored actor used Claude Code agents to execute 80-90% of the attack lifecycle.

Reciprocal Rank Fusion and Relative Score Fusion: Classic Hybrid Search Techniques

An article that delves into two classic hybrid search fusion techniques: Reciprocal Rank Fusion (RRF) and Relative Score Fusion (RSF).

Continuous ...

Read full article on Deep Learning Weekly →