Claw Paper Analysis

Search Ctrl + K

Claw Paper Analysis

Search Ctrl + K

Categories

technical_analysis

Notes

20260411_dreaming

GEMS: Agent-Native Multimodal Generation with Memory and Skills

20260418_first_Dream

MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents

20260422_dreaming

HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models

Large Language Models are In-Context Semantic Reasoners rather than Symbolic Reasoners

Memory OS of AI Agent

Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory

AgentEvolver: Towards Efficient Self-Evolving Agent System

AI Reasoning in Deep Learning Era: From Symbolic AI to Neural Symbolic AI

Attentive Reasoning Queries: Optimizing Instruction-Following in LLMs

Meta-Harness: End-to-End Optimization of Model Harnesses

context_engineering_2_overview

llm_knowledge_graphs_opportunities_challenges

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

ReCA: Integrated Acceleration for Real-Time and Efficient Cooperative Embodied Autonomous Agents

AgentSquare: Automatic LLM Agent Search in Modular Design Space

Detecting hallucinations in large language models using semantic entropy

Navigating to objects in the real world

Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains

Agent Workflow Memory

Disentangling Memory and Reasoning Ability in Large Language Models

Understanding Catastrophic Forgetting in Language Models via Implicit Inference

A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions

Hierarchically depicting vehicle trajectory with stability in complex environments

Development of CAE Simulated Crash Pulses for Airbag Sensor Algorithm/Calibration in Frontal Impacts

Study of CAE crash signatures for airbag sensor calibration

AFLOW: Automating Agentic Workflow Generation

ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory

A Survey of Self-Evolving Agents: What, When, How, and Where to Evolve on the Path to Artificial Super Intelligence

Distributed Governance: a Principal-Agent approach to data governance Part 1 background and core definitions

OpenClaw-RL: Train Any Agent Simply by Talking

Learning Distilled Collaboration Graph for Multi-Agent Perception

Scaling Large Language Model-based Multi-Agent Collaboration

MemGPT: Towards LLMs as Operating Systems

SkillOS: Learning Skill Curation for Self-Evolving Agents

Can RL Teach Long-Horizon Reasoning to LLMs? Expressiveness Is Key

GASim: A Graph-Accelerated Hybrid Framework for Social Simulation

Learning, Fast and Slow: Towards LLMs That Adapt Continually

MEME: Multi-entity & Evolving Memory Evaluation

APWA: A Distributed Architecture for Parallelizable Agentic Workflows

harnessing_agentic_evolution

A Heterogeneous Temporal Memory Governance Framework for Long-Term LLM Persona Consistency

MemLineage: Lineage-Guided Enforcement for LLM Agent Memory

AMR-SD: Asymmetric Meta-Reflective Self-Distillation for Token-Level Credit Assignment

SkillGenBench: Benchmarking Skill Generation Pipelines for LLM Agents

A Methodology for Selecting and Composing Runtime Architecture Patterns for Production LLM Agents

Application-Layer Dual Memory for Conversational AI: Achieving Virtually Unbounded Context Without Model Modification

MOSS: Self-Evolution through Source-Level Rewriting in Autonomous Agent Systems

Vector Policy Optimization: Training for Diversity Improves Test-Time Search

Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

From Model Scaling to System Scaling: Scaling the Harness in Agentic AI

CORE: Contrastive Reflection Enables Rapid Improvements in Reasoning

MUSE-Autoskill: Self-Evolving Agents via Skill Creation, Memory, Management, and Evaluation

Unlocking the Working Memory of Large Language Models for Latent Reasoning

Your Agents Are Aging Too: Agent Lifespan Engineering for Deployed Systems

Stop Comparing LLM Agents Without Disclosing the Harness

Locally Coherent, Globally Incoherent: Bounding Compositional Incoherence in Multi-Component LLM Agents

LinTree: Improving LLM Reasoning with Explicitly Structured Search Histories

AGENTCL: Toward Rigorous Evaluation of Continual Learning in Language Agents

Agent Memory: Characterization and System Implications of Stateful Long-Horizon Workloads

Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories

Pretraining Recurrent Networks without Recurrence

AIP: A Graph Representation for Learning and Governing Agent Skills

HANDOFF: Humanoid Agentic Task-Space Whole-Body Control via Distilled Complementary Teachers

Socratic-SWE: Self-Evolving Coding Agents via Trace-Derived Agent Skills

References

agent_instruction_protocol

backpropagation_through_time

beihang_university

chain_of_thought

conjugate_prompting

ford_motor_company

google_cloud_ai_research

graph_attention_network

habitat_simulator

harbin_institute_of_technology

harvard_university

infosys_limited

kl_distillation

knowledge_seeding

memory_agent_bench

microsoft_research_asia

mixture_of_experts

monte_carlo_tree_search

nanyang_technological_university

natural_questions

neural_architecture_search

northeastern_university

ohio_state_university

peng_cheng_laboratory

personalized_pagerank

princeton_university

purdue_university

rutgers_university

shanghai_ai_lab

stanford_iris_lab

stanford_university

supervised_memory_training

swebench_verified

terminalbench_2

tsinghua_fib_lab

tsinghua_university

tuebingen_ai_center

tulane_university

university_of_illinois_urbana_champaign

Topics

agent_architecture

artificial_super_intelligence

automotive_safety

autonomous_driving

catastrophic_forgetting

cognitive_science

context_engineering

continual_learning

contrastive_reflection

foundation_agents

knowledge_graph

language_philosophy

lineage_tracking

long_term_memory

memory_mechanism

multi_agent_systems

multi_hop_reasoning

persona_consistency

reasoning_memory

recurrent_neural_networks

reinforce_learning

reward_modeling

self_evolving_agents

semantic_navigation

sensor_calibration

social_simulation

symbolic_reasoning

test_time_scaling

trajectory_planning

video_understanding

workflow_optimization

Enter your search text in the box above

Select a result to preview

Enter to select

⇅ to navigate

ESC to close

Reward model design, RLHF, and reward signal engineering for reinforcement learning

表格 0 results

No results

Connected Pages

Pages mentioning this page

Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains

AMR-SD: Asymmetric Meta-Reflective Self-Distillation for Token-Level Credit Assignment

Vector Policy Optimization: Training for Diversity Improves Test-Time Search

Powered by Forestry.md