PyoSignal Logo
PyoSignal
Back to Research

Agentic Abstention: Do Agents Know When to Stop Instead of Act?

Paper ID: 2606.28733 β€’ 115 Upvotes
LLM Agent Decision Making Prompt Engineering Cost Optimization Agent Reasoning Evaluation Distillation
Agentic Abstention: Do Agents Know When to Stop Instead of Act?

πŸ“ 핡심 μš”μ•½

μ—μ΄μ „νŠΈκ°€ λΆˆκ°€λŠ₯ν•˜κ±°λ‚˜ λͺ¨ν˜Έν•œ μž‘μ—…μ— λŒ€ν•΄ λ¬΄μ˜λ―Έν•œ λ°˜λ³΅μ„ λ©ˆμΆ”κ³  μ μ ˆν•œ μ‹œμ μ— 쀑단(Abstention)ν•˜λŠ” λŠ₯λ ₯을 μ—°κ΅¬ν•˜κ³  κ°œμ„ ν•˜λŠ” 방법둠 μ œμ‹œ

πŸ“– 상세 λ‚΄μš©

LLM μ—μ΄μ „νŠΈλŠ” λ³΅μž‘ν•œ ν™˜κ²½μ—μ„œ λ‹€νšŒμ°¨ μƒν˜Έμž‘μš©μ„ 톡해 λͺ©ν‘œλ₯Ό λ‹¬μ„±ν•˜μ§€λ§Œ, λͺ©ν‘œκ°€ λΆˆκ°€λŠ₯ν•˜κ±°λ‚˜ λͺ¨ν˜Έν•œ κ²½μš°μ—λ„ κ³„μ†ν•΄μ„œ 도ꡬλ₯Ό ν˜ΈμΆœν•˜λŠ” λ¬Έμ œκ°€ λ°œμƒν•©λ‹ˆλ‹€. λ³Έ 논문은 μ—μ΄μ „νŠΈκ°€ λΆˆν™•μ‹€μ„± μ†μ—μ„œ μ–Έμ œ 행동을 λ©ˆμΆ°μ•Ό ν•˜λŠ”μ§€λ₯Ό κ²°μ •ν•˜λŠ” 'Agentic Abstention' 문제λ₯Ό μ •μ˜ν•©λ‹ˆλ‹€. 연ꡬ진은 μ›Ή μ‡Όν•‘, 터미널, μ§ˆμ˜μ‘λ‹΅ λ“± λ‹€μ–‘ν•œ ν™˜κ²½μ—μ„œ 13개의 μ—μ΄μ „νŠΈ μ‹œμŠ€ν…œμ„ λŒ€μƒμœΌλ‘œ λŒ€κ·œλͺ¨ μ‹€ν—˜μ„ μ§„ν–‰ν–ˆμŠ΅λ‹ˆλ‹€. μ‹€ν—˜ κ²°κ³Ό, λͺ¨λΈμ˜ 규λͺ¨κ°€ ν¬κ±°λ‚˜ μΆ”λ‘  λŠ₯λ ₯이 높더라도 μ μ ˆν•œ μ‹œμ μ— μ€‘λ‹¨ν•˜λŠ” λŠ₯λ ₯은 였히렀 λ–¨μ–΄μ§ˆ 수 μžˆμŒμ„ ν™•μΈν–ˆμŠ΅λ‹ˆλ‹€. 이λ₯Ό ν•΄κ²°ν•˜κΈ° μœ„ν•΄ μƒν˜Έμž‘μš© ꢀ적을 μž¬μ‚¬μš© κ°€λŠ₯ν•œ 쀑단 κ·œμΉ™μœΌλ‘œ μ •μ œν•˜λŠ” μ»¨ν…μŠ€νŠΈ μ—”μ§€λ‹ˆμ–΄λ§ 방법둠인 CONVOLVEλ₯Ό μ œμ•ˆν•©λ‹ˆλ‹€. 결과적으둜 CONVOLVEλŠ” λͺ¨λΈ νŒŒλΌλ―Έν„° μ—…λ°μ΄νŠΈ 없이도 μ—μ΄μ „νŠΈμ˜ μ μ‹œ 쀑단 μ„±λŠ₯을 크게 ν–₯μƒμ‹œμΌ°μŠ΅λ‹ˆλ‹€.

πŸ”‘ μ£Όμš” λ‚΄μš© (Key Points)

  • μ—μ΄μ „νŠΈμ˜ 순차적 μ˜μ‚¬κ²°μ • λ¬Έμ œλ‘œμ„œμ˜ 'Agentic Abstention' κ°œλ… μ •μ˜
  • λͺ¨λΈ 규λͺ¨ 및 μŠ€μΊν΄λ”©μ΄ μ μ‹œ 쀑단(Timely Abstention)에 λ―ΈμΉ˜λŠ” 볡합적 영ν–₯ 뢄석
  • ꢀ적 증λ₯˜(Trajectory Distillation) 기반의 μ»¨ν…μŠ€νŠΈ μ—”μ§€λ‹ˆμ–΄λ§ 기법 'CONVOLVE' μ œμ•ˆ

πŸ’‘ 싀무적 κ°€μΉ˜ (Relevance)

μ—μ΄μ „νŠΈκ°€ λ¬΄ν•œ 루프에 λΉ μ§€κ±°λ‚˜ λΆˆν•„μš”ν•œ API 호좜둜 λΉ„μš©μ„ λ‚­λΉ„ν•˜λŠ” 것을 λ°©μ§€ν•˜κΈ° μœ„ν•œ 싀무적 κ°€μ΄λ“œλΌμΈμ„ μ œκ³΅ν•©λ‹ˆλ‹€.

βœ… μΆ”μ²œ μ•‘μ…˜ (Actionable Items)

  • ν˜„μž¬ 운영 쀑인 μ—μ΄μ „νŠΈμ˜ '쀑단 μ‹œμ '에 λŒ€ν•œ 둜그λ₯Ό λΆ„μ„ν•˜μ—¬ λΆˆν•„μš”ν•œ 반볡 횟수 μΈ‘μ •
  • μ‹€νŒ¨κ°€ λͺ…ν™•ν•œ μ‹œλ‚˜λ¦¬μ˜€λ₯Ό ν¬ν•¨ν•œ ν…ŒμŠ€νŠΈμ…‹μ„ κ΅¬μΆ•ν•˜μ—¬ μ—μ΄μ „νŠΈμ˜ 쀑단 μ„±λŠ₯ 평가
  • CONVOLVE와 같은 μ»¨ν…μŠ€νŠΈ μ—”μ§€λ‹ˆμ–΄λ§ 기법을 μ μš©ν•˜μ—¬ ν”„λ‘¬ν”„νŠΈ 기반 쀑단 κ·œμΉ™ μ‹€ν—˜