PyoSignal Logo
PyoSignal
Back to Community
πŸ€– Reddit r/MachineLearning

Bulding my own Diffusion Language Model from scratch was easier than I thought [P]

35 upvotes 5 comments Read on Reddit
DiffusionModel AI LanguageModel MachineLearning

πŸ“ AI Summary

ν•œ μ‚¬μš©μžκ°€ AI μ½”λ“œ 도움 없이 직접 Diffusion Language Model을 κ΅¬ν˜„ν•˜λŠ” ν”„λ‘œμ νŠΈλ₯Ό μ§„ν–‰ν–ˆμœΌλ©°, Karpathy의 μž‘μ€ Shakespeare λ°μ΄ν„°μ…‹μœΌλ‘œ MacBook Air M2μ—μ„œ λͺ‡ μ‹œκ°„ λ™μ•ˆ ν•™μŠ΅μ‹œν‚¨ κ²°κ³Όλ₯Ό κ³΅μœ ν–ˆμŠ΅λ‹ˆλ‹€. ν”„λ‘œμ νŠΈλ₯Ό 톡해 Diffusion λͺ¨λΈμ˜ κΈ°λ³Έ κ°œλ…μ„ μ΄ν•΄ν•˜λŠ” 데 도움이 λ˜μ—ˆμœΌλ©°, λ‹€λ₯Έ μ‚¬μš©μžλ“€λ„ λΉ„μŠ·ν•œ ν”„λ‘œμ νŠΈμ— 관심을 λ³΄μ˜€μŠ΅λ‹ˆλ‹€.

πŸ”‘ Key Discussion Points

  • β€’ AI μ½”λ“œ 도움 없이 Diffusion Language Model을 μ²˜μŒλΆ€ν„° κ΅¬ν˜„ν•˜λŠ” ν”„λ‘œμ νŠΈλ₯Ό 진행함.
  • β€’ 7.5M νŒŒλΌλ―Έν„° λͺ¨λΈμ„ Karpathy의 Shakespeare λ°μ΄ν„°μ…‹μœΌλ‘œ MacBook Air M2μ—μ„œ λͺ‡ μ‹œκ°„ λ™μ•ˆ ν•™μŠ΅μ‹œν‚΄.
  • β€’ ν”„λ‘œμ νŠΈλ₯Ό 톡해 (discrete) diffusion, encoder, decoder, tokenizer와 같은 κ°œλ…μ„ μ΄ν•΄ν•˜λŠ” 데 도움이 λ˜μ—ˆλ‹€κ³  언급함.