PyoSignal Logo
PyoSignal
Back to Research

GateMem: Benchmarking Memory Governance in Multi-Principal Shared-Memory Agents

Paper ID: 2606.18829 โ€ข 13 Upvotes
LLM-Agent Memory-Governance Security RAG Agent Benchmark Evaluation
GateMem: Benchmarking Memory Governance in Multi-Principal Shared-Memory Agents

๐Ÿ“ ํ•ต์‹ฌ ์š”์•ฝ

๋‹ค์ค‘ ์‚ฌ์šฉ์ž ํ™˜๊ฒฝ์˜ ๊ณต์œ  ๋ฉ”๋ชจ๋ฆฌ ์—์ด์ „ํŠธ๋ฅผ ์œ„ํ•œ ๊ฑฐ๋ฒ„๋„Œ์Šค(์ ‘๊ทผ ์ œ์–ด ๋ฐ ์‚ญ์ œ) ์ค‘์‹ฌ์˜ ๋ฒค์น˜๋งˆํฌ ์ œ์•ˆ

๐Ÿ“– ์ƒ์„ธ ๋‚ด์šฉ

๊ธฐ์กด LLM ์—์ด์ „ํŠธ ๋ฉ”๋ชจ๋ฆฌ ๋ฒค์น˜๋งˆํฌ๋Š” ๋‹จ์ผ ์‚ฌ์šฉ์ž ํ™˜๊ฒฝ์— ์น˜์ค‘๋˜์–ด ์žˆ์–ด, ๋ณ‘์›์ด๋‚˜ ์‚ฌ๋ฌด์‹ค ๊ฐ™์€ ๋‹ค์ค‘ ์‚ฌ์šฉ์ž ๊ณต์œ  ํ™˜๊ฒฝ์—์„œ์˜ ๋ฉ”๋ชจ๋ฆฌ ๊ด€๋ฆฌ ๋ฌธ์ œ๋ฅผ ๊ฐ„๊ณผํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค. ๋ณธ ๋…ผ๋ฌธ์€ ์—ฌ๋Ÿฌ ์‚ฌ์šฉ์ž๊ฐ€ ๊ณตํ†ต ๋ฉ”๋ชจ๋ฆฌ ํ’€์„ ์‚ฌ์šฉํ•˜๋ฉฐ ๊ฐ์ž์˜ ์—ญํ• ๊ณผ ๊ถŒํ•œ์— ๋”ฐ๋ผ ์ ‘๊ทผํ•˜๋Š” ์ƒํ™ฉ์„ ํ‰๊ฐ€ํ•˜๊ธฐ ์œ„ํ•ด GateMem ๋ฒค์น˜๋งˆํฌ๋ฅผ ๋„์ž…ํ•ฉ๋‹ˆ๋‹ค. GateMem์€ ์ •๋ณด์˜ ์œ ์šฉ์„ฑ(Utility), ๊ถŒํ•œ ๊ฒฝ๊ณ„์— ๋”ฐ๋ฅธ ์ ‘๊ทผ ์ œ์–ด(Access Control), ๊ทธ๋ฆฌ๊ณ  ๋ช…์‹œ์  ์‚ญ์ œ ์š”์ฒญ์— ๋”ฐ๋ฅธ ๋ง๊ฐ(Forgetting) ๋Šฅ๋ ฅ์„ ์ข…ํ•ฉ์ ์œผ๋กœ ํ‰๊ฐ€ํ•ฉ๋‹ˆ๋‹ค. ์˜๋ฃŒ, ์‚ฌ๋ฌด, ๊ต์œก ๋“ฑ ๋‹ค์–‘ํ•œ ๋„๋ฉ”์ธ์˜ ์‹œ๋‚˜๋ฆฌ์˜ค๋ฅผ ํ†ตํ•ด ๋ชจ๋ธ์˜ ์„ฑ๋Šฅ์„ ๊ฒ€์ฆํ•œ ๊ฒฐ๊ณผ, ํ˜„์žฌ์˜ ์—์ด์ „ํŠธ๋“ค์€ ์œ ์šฉ์„ฑ๊ณผ ๋ณด์•ˆ์„ฑ ์‚ฌ์ด์˜ ๊ท ํ˜•์„ ์žก๋Š” ๋ฐ ์–ด๋ ค์›€์„ ๊ฒช๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค. ํŠนํžˆ RAG ๋ฐฉ์‹์€ ๋น„์šฉ์€ ๋‚ฎ์ง€๋งŒ ๊ถŒํ•œ์ด ์—†๋Š” ์ •๋ณด ์œ ์ถœ์ด๋‚˜ ์‚ญ์ œ๋œ ์ •๋ณด ๋…ธ์ถœ ๋ฌธ์ œ๊ฐ€ ๋ฐœ์ƒํ•จ์„ ํ™•์ธํ–ˆ์Šต๋‹ˆ๋‹ค.

๐Ÿ”‘ ์ฃผ์š” ๋‚ด์šฉ (Key Points)

  • ๋‹ค์ค‘ ์‚ฌ์šฉ์ž(Multi-principal) ํ™˜๊ฒฝ์„ ๋ฐ˜์˜ํ•œ ๊ณต์œ  ๋ฉ”๋ชจ๋ฆฌ ๊ฑฐ๋ฒ„๋„Œ์Šค ๋ฒค์น˜๋งˆํฌ 'GateMem' ๊ฐœ๋ฐœ
  • ์ •๋ณด ์œ ์šฉ์„ฑ, ์ ‘๊ทผ ์ œ์–ด(Access Control), ๋Šฅ๋™์  ๋ง๊ฐ(Active Forgetting)์˜ ์„ธ ๊ฐ€์ง€ ํ•ต์‹ฌ ์ง€ํ‘œ ํ†ตํ•ฉ ํ‰๊ฐ€
  • ํ˜„์กดํ•˜๋Š” RAG ๋ฐ ์™ธ๋ถ€ ๋ฉ”๋ชจ๋ฆฌ ๋ฐฉ์‹์ด ๋ณด์•ˆ ๋ฐ ์‚ญ์ œ ์š”์ฒญ ์ดํ–‰์—์„œ ์ทจ์•ฝํ•จ์„ ์ž…์ฆ

๐Ÿ’ก ์‹ค๋ฌด์  ๊ฐ€์น˜ (Relevance)

ํ˜‘์—… ํˆด์ด๋‚˜ ๊ณต์œ  ๋น„์„œ ์—์ด์ „ํŠธ ๊ฐœ๋ฐœ ์‹œ, ๋‹จ์ˆœํ•œ ์ •๋ณด ๊ฒ€์ƒ‰(RAG)์„ ๋„˜์–ด ์‚ฌ์šฉ์ž๋ณ„ ๊ถŒํ•œ ๊ด€๋ฆฌ์™€ ๊ฐœ์ธ์ •๋ณด ์‚ญ์ œ๊ฐ€ ์‹ค๋ฌด์ ์œผ๋กœ ์–ผ๋งˆ๋‚˜ ์–ด๋ ค์šด์ง€ ์‹œ์‚ฌํ•ฉ๋‹ˆ๋‹ค.

โœ… ์ถ”์ฒœ ์•ก์…˜ (Actionable Items)

  • ์—์ด์ „ํŠธ ์„ค๊ณ„ ์‹œ ๋‹จ์ˆœ RAG ์™ธ์— ๊ถŒํ•œ ๊ณ„์ธต(Role-based Access Control)์ด ์ ์šฉ๋œ ๋ฉ”๋ชจ๋ฆฌ ๋ ˆ์ด์–ด ์„ค๊ณ„ ์‹คํ—˜
  • ์‚ฌ์šฉ์ž์˜ '์‚ญ์ œ ์š”์ฒญ' ์‹œ ๋ฒกํ„ฐ DB ๋ฐ ์บ์‹œ์—์„œ ๋ฐ์ดํ„ฐ๊ฐ€ ์™„์ „ํžˆ ์ œ๊ฑฐ๋˜๋Š”์ง€ ๊ฒ€์ฆํ•˜๋Š” ํ…Œ์ŠคํŠธ ์ผ€์ด์Šค ๊ตฌ์ถ•
  • Long-context ํ”„๋กฌํ”„ํŒ…๊ณผ RAG ๋ฐฉ์‹ ๊ฐ„์˜ ๋ณด์•ˆ์„ฑ vs ๋น„์šฉ ํšจ์œจ์„ฑ ํŠธ๋ ˆ์ด๋“œ์˜คํ”„ ๋ถ„์„