KIDRS

논문검색

논문제목3D 디지털 휴먼의 자연스러운 한국어 발화를 위한 AI 학습용 데이터세트 구축 사례
영문A Case Study on the Construction of an AI Training Dataset for Natural Korean Speech Generation by 3D Digital Humans
저자이솔,이학범,김진겸,오문석,서영호첨부파일
초록
Recently, there has been a growing trend of adopting 3D digital humans (also referred to as virtual humans) in the production of broadcast and video content. Due to their controllability and ability to be utilized without temporal or spatial constraints, digital humans (or virtual humans) are emerging as an effective solution for enhancing production efficiency. In particular, the production of digital humans is shifting from traditional 2D formats to interactive 3D forms, and accordingly, various datasets for creating 3D digital humans are now being made available on public data platforms such as AI Hub. However, there is currently a lack of facial data necessary for generating speech-driven 3D facial animation videos that can realistically express Korean speech through 3D digital humans (or virtual humans). This study aims to address this gap by constructing and releasing a dataset specifically designed for producing 3D digital humans (or virtual humans) capable of natural Korean speech. By doing so, we seek to reduce production costs through the application of generative AI and contribute to the advancement of related technologies and the activation of the associated industry ecosystem. Furthermore, this research aligns with the national AI policy initiative of "AI in Everyday Life," corresponds with recent R&D priorities combining AI and the metaverse, and is expected to be applicable to the development of the Digital Platform Government in the future.