ð§ ãã³ã³ããã¹ããèªãããããã³ã³ããã¹ãããã¹ãã«ã身ã«ã€ããããžã人æã®æ³šéãå€éšãã£ãŒãããã¯ã䜿ãããèªå·±å¯ŸæŠã ãã§LLMãæèåºæã®ã¹ãã«ãç²åŸããææ³ã§ãã
ã¿ã€ãã«: From Context to Skills: Can Language Models Learn from Context Skillfully?
URL:
ð æŠèŠ
LLMã¯äºååŠç¿ã«ããç¥èã¯åŸæã§ãããæ°èŠã§å°éçãªæèã«ã¯åŒ±ãã§ããæ¬è«æã¯ãäººææ³šéãå€éšãã£ãŒãããã¯ãªãã«ãæèåºæã®ã¹ãã«ãèªåŸçã«çºèŠã»æŽç·ŽããCtx2Skillãææ¡ããŸãã
â 解決ãã課é¡
é·ãå°éçãªææžã§ã¯ã¢ãããŒã·ã§ã³ã®ã³ã¹ããé«ãããŸããããã«ã³ãŒãã£ã³ã°ãšéããã³ã³ããã¹ãåŠç¿ã«ã¯å®è¡ãã£ãŒãããã¯ã®ãããªæ€èšŒä¿¡å·ããªããèªåçãªã¹ãã«æ§ç¯ãå°é£ã§ããã
ð¡ æ¹æ³è«ãšææ¡ææ³
ã»åçµããLMã«ãã5圹å²ã®ãã«ããšãŒãžã§ã³ãèªå·±å¯ŸæŠããM=5ã¿ã¹ã¯ã«N=5åå埩ããŸã
ã»Challengerã匱ç¹ãçªãã¿ã¹ã¯ãšã«ãŒããªãã¯ãäœããReasonerãè§£ããJudgeãååŠãå€å®ããŸã
ã»ProposerãšGeneratorã倱æã蚺æããŠã¹ãã«æŽæ°ãåæããŸã
ã»Cross-Time Replayã§ãé£åãšæåã®æ§èœã®ç©ãæå€§åããå埩ããŸããã§æãæ±åããã¹ãã«ã»ãããéžã³ãŸã
ð¯ ãŠãŒã¹ã±ãŒã¹
å°éé åã®é·æããã¥ã¡ã³ããäžããŠããã®å Žã§ã¢ãã«ã«å¿
èŠãªã¹ãã«ãç²åŸãããçšéã«åããŸãããã¡ã€ã³åºæã®ç¥èãžçŽ æ©ãé©å¿ããããå®åã«çŽçµããŸãã
ð å®éšçµæ
ã»CL-BenchïŒ500ã³ã³ããã¹ãã1,899ã¿ã¹ã¯ïŒã§ãGPT-4.1ã®è§£ççã11.1%â16.5%ã«åäž
ã»GPT-5.1ã¯21.1%â25.8%ãGPT-5.2ã¯18.2%â21.4%
ã»åŒ·ãã¢ãã«ã®ã¹ãã«ã匱ãã¢ãã«ãžè»¢ç§»ããGPT-5.1ã®ã¹ãã«ãGPT-4.1ã«é©çšãããš16.1%
ã»é©çšåŸã®GPT-4.1ïŒ16.5%ïŒã¯ãæ¡åŒµãªãã®Gemini 3 ProïŒ15.8%ïŒãäžåããŸãã
#
LLM# #
InContextLearning#