Kento Sasaki
I am a graduate student in Informatics at the University of Tsukuba, affiliated with the Communication Understanding Laboratory. My current research focuses on the applications of Vision Language Action Models for autonomous driving.
Education
- Master of Science in Informatics, University of Tsukuba (2023 - present)
- Bachelor of Arts in Library and Information Science, University of Tsukuba (2021 - 2023)
- Associate Degree in Electronic Control System Engineering, National Institute of Technology (KOSEN), Numazu College (2015 - 2020)
Work Experience
- Turing Inc. Research Engineer (April 2023 - present)
- Turing Inc. Internship (June 2022 - March 2023)
- National Institute for Materials Science, Technical Staff (December 2021 - June 2022)
- National Institute for Materials Science, Research Internship (August 2021 - September 2021)
Publications
Peer Reviewed Journals
- Kento Sasaki, Yohei Seki. Exploration of Commentary Generation Methods Considering the Components of Shogi Commentary Texts. DBSJ Journal Data-Driven Studies, Vol. 2, Article No 3, 2024.
International Conferences
- Hidehisa Arai*, Keita Miwa*, Kento Sasaki*, Yu Yamagichi, Kohei Watanabe, Shunsuke Aoki, Issei Yamamoto. CoVLA: Comprehensive Vision-Language Action Dataset for Autonomous Driving, In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025. [arXiv]
- Yuichi Inoue*, Kento Sasaki*, Yuma Ochi, Kazuki Fujii, Kotaro Tanahashi, Yu Yamaguchi. Heron-Bench: A Benchmark for Evaluating Vision Language Models in Japanese. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), The 3rd Workshop on Computer Vision in the Wild, 2024. [arXiv]
Domestic Conferences
- 三輪敬太*, 荒居秀尚*, 佐々木謙人*, 渡辺晃平, 山口祐. 自動運転のための言語・視覚・動作の統合データセットの構築. 第19回YANSシンポジウム, 2024, S5-P04.
- 佐々木謙人*, 井ノ上雄一*, 藤井一喜, 棚橋耕太郎, 山口祐. 大規模言語モデルを用いた日本語視覚言語モデルの構築と評価方法の提案. 第27回画像の認識・理解シンポジウム (MIRU), 2024, OS-2A-01.
- 佐々木謙人, 関洋平. 将棋解説文の構成要素を考慮した解説文生成手法の検討, 第15回データ工学と情報マネジメントに関するフォーラム (DEIM), 2023, 1a-7-5.
- 佐々木謙人, 関洋平. 将棋解説文の構成要素の定義と判別, ARG 第18回 Webインテリジェンスとインタラクション研究会 (WI2), 2022, pp. 75-78.
- 佐々木謙人, 山路倍弘,橋本敬之,北本朝展,鈴木静男. 伊豆地域における古文書のディープラーニングを用いた文字認識の予備的調査, GIS -理論と応用-, 2019, Vol. 27, No. 2, p. 159(93).
arXiv
- Hidehisa Arai*, Keita Miwa*, Kento Sasaki*, Yu Yamagichi, Kohei Watanabe, Shunsuke Aoki, Issei Yamamoto. CoVLA: Comprehensive Vision-Language Action Dataset for Autonomous Driving, arXiv preprint arXiv:2408.10845, 2024. [arXiv]
- Yuichi Inoue*, Kento Sasaki*, Yuma Ochi, Kazuki Fujii, Kotaro Tanahashi, Yu Yamaguchi. HERON-BENCH: A BENCHMARK FOR EVALUATING VISION LANGUAGE MODELS IN JAPANESE. arXiv preprint arXiv:2404.07824, 2024. [arXiv]
Awards and Honors
- YANS 2024 Encouragement Award (co-author)
- MIRU 2024 Student Encouragement Award
- University of Tsukuba Almni Association Ezaki Award 2023
- DEIM 2023 Excellent Interactive Award
- DEIM 2023 Sponsor Award (LayerX Inc.)
- ARG 18th Workshop on WI2 Excellent Research Award
- 28th GISA Conference Poster Session Award
- Suzuki Education & Culture Foundation Scholarship
Talks
- Cultural Lecture 2023, National Institute of Technology (KOSEN), Numazu College (October 2023)