[NAVER Cloud] Model Driven Multimodal LLM Curation (°æ·Â)
´ã´ç¾÷¹« Model Driven Vision DATA Curation • Vision Language Model »ý»ê Àüü ´Ü°è¿¡ À̸£´Â ÇнÀ ¹× Æò°¡ µ¥ÀÌÅÍ ¼³°è • µ¥ÀÌÅÍ Ç°Áú Çâ»óÀ» À§ÇÑ ¸ðµ¨ ±â¹ÝÀÇ ÇнÀ µ¥ÀÌÅÍ Assessment ¹× Filtering ¿¡ ´ëÇÑ ¹æ¹ý·Ð ޱ¸ • ÃÖÀûÀÇ Recipe Ž»öÀ» À§ÇÑ Curation ¹æ¹ý·Ð °³¹ß ¹× ¸ðµ¨ ÇнÀ • ±¤¹üÀ§ÇÑ Domain & TaskÀÇ Dataset¿¡ ´ëÇÏ¿© ¼­·ÎÀÇ ¿µÇâµµ ¹× ÃÖÁ¾ ¸ðµ¨ ¼º´É¿¡ ¹ÌÄ¡´Â ¿µÇâ Ž±¸ • ¹®Á¦ Ç®ÀÌ ¹× Reasoning ¿µ¿ªÀ» Æ÷ÇÔÇÑ Æ¯È­ µ¥ÀÌÅÍ È®º¸ ¹× ÃÖÁ¾ ¸ðµ¨ ¼º´É ¿µÇâ ÁõÁø • Foundation ¸ðµ¨ °³¹ßÀ» À§ÇÑ ´ë±Ô¸ð Pretraining µ¥ÀÌÅÍ ¼³°è • Reasoning ¼º´É Çâ»óÀ» À§ÇÑ RLVR Reward ¹× °ü·Ã µ¥ÀÌÅÍ ¼³°è ÀÚ°Ý¿ä°Ç