ðŒ ç»åç·šéã®ãã¹ãæã¹ã±ãŒãªã³ã°ã¯ãã©ããªç·šéã«ãåãèšç®äºç®ããå²ãåœãŠãã¡ã§ãç¡é§ã ããã§ãããé£æåºŠã«å¿ããŠé
åããç·šéã«ç¹åããæ€èšŒã§æåãããããšã§ãå質ãä¿ã£ããŸãŸæå€§2.2åã®é«éåãå®çŸããç ç©¶ã§ãã
ã¿ã€ãã«: From Scale to Speed: Adaptive Test-Time Scaling for Image Editing
URL:
ð æŠèŠ
ADE-CoTã¯ãç®çå¿åã®ç»åç·šéã«ç¹åãããã¹ãæã¹ã±ãŒãªã³ã°ææ³ã§ããããã¹ãããç»åãäœãçæåãã«äœãããåŸæ¥ã®Image-CoTããã®ãŸãŸç·šéã«æµçšããã®ã§ã¯ãªãããé£æåºŠã«å¿ããè³æºé
åããç·šéç¹åã®æ©ææ€èšŒããæ©äŒäž»çŸ©çãªåæ¢ããšãã3ã€ã®æŠç¥ãçµã¿åãããèšç®ã倧ããç¯çŽããªããå質ãç¶æããŸãã
â 解決ãã課é¡
åŸæ¥ææ³ã«ã¯3ã€ã®ãã¹ãããããããŸããã
ã»åºå®ã®ãµã³ããªã³ã°äºç®ããã»ãšãã©æ¹åããªãç°¡åãªç·šéã«ãèšç®ã浪費ãã
ã»æ±çšã®MLLMã¹ã³ã¢ããæ©æã¹ã³ã¢ã¯äœããŠãæçµçã«é«åŸç¹ã«ãªããµã³ãã«ã®çŽ40%ã誀ã£ãŠæåãããŠããŸã
ã»å€§èŠæš¡ãµã³ããªã³ã°ãåäžã®æ£è§£ãäœåºŠãçã¿ãäžèŠãªèšç®ãå¢ãã
ð¡ æ¹æ³è«ãšææ¡ææ³
ã»ç·šéã®é£æåºŠãèŠãŠãç°¡åãªç·šéã¯æå°äºç®ãè€éãªç·šéã¯æ¢çŽ¢ãæ¡å€§ããŸã
ã»ã¯ã³ã¹ãããã»ãã¬ãã¥ãŒã§ã远å ã®ããã€ãžã³ã°ãªãã«ãã€ãºäžéç¶æ
ããã¯ãªãŒã³ãªæœåšãæšå®ããæ©ææ€èšŒãä¿¡é Œã§ãããã®ã«ããŸã
ã»Grounded SAM2ã§ãæå³ããé åã ããå€ãã£ããããæ€èšŒããDINOv2ã®åã蟌ã¿ã§åé·ãªåè£ãé€å»ããŸã
ã»åè£ã鿬¡çæããæå³ã«åãçµæãååã«åŸãããæç¹ã§æã¡åãæ·±ãåªå
ã®åæ¢ã䜿ããŸã
ð¯ ãŠãŒã¹ã±ãŒã¹
è€éãªå§¿å¢å€æŽãè€æ°ãªããžã§ã¯ãã®åé€ã眮æã现ç²åºŠã®é åç·šéããã«ãã¿ãŒã³ã®é次線éããããŠèšç®å¶çŽäžã§ã®é«å質線éã«åããŸããæ¬çªã®ç»åç·šéAPIã®ããã«æšè«ã³ã¹ããå¹ãå Žé¢ã§ç¹ã«æå¹ã§ãã
ð å®éšçµæ
ã»GEdit-Benchã§ãFLUX.1 KontextãBest-of-Næ¯2.2åãBAGELã1.8åãStep1X-Editã2.0åã®é«éåãéæããŸãã
ã»æšè«å¹çã¯åºå®32ãµã³ãã«äºç®ã§2åè¶
ãçµæå¹çã¯3ã€ã®ãã³ãã§4.9åã»2.7åã»2.9åã«åäžããŸãã
ã»ãçœãæã®å¥³æ§ã®é£ã«ç«ã€äººãæ¶ãããšãã£ãé£ããè€æ°ãªããžã§ã¯ãç·šéã§ããããŒã¹ã©ã€ã³ã®èª€èªãæ£ãã解決ããŸãã
#
ImageEditing# #
DiffusionModels#