I am Thanh Tran, an AI Resident at FPT Software AI Center, under the supervision of Dr. Van Nguyen and Prof. Son Hy. Before that, I obtained my B.S. degree in Computer Science from University of Engineering and Technology, Vietnam National University. I used to intern at VinBigData, working on Automatic Speech Recognition.

My research interests focus on audio and speech synthesis, particularly their generation from other modalities like visual data and text. I also have experience in AI for Science, with three papers and one workshop published in the field of protein design.

My attached CV (last updated: 2025 Jun).

🔥 News

2025.05: 🎉 One paper is accepted at Interspeech 2025!
2025.03: 🎉 One paper is accepted at Machine Learning: Science and Technology!
2024.12: 🎉 One paper is accepted at ICASSP 2025!
2024.11: 🎉 One paper is accepted at KDD 2025!
2024.10: 🎉 One paper is accepted at Machine Learning in Structural Biology (MLSB) Workshop in NeurIPS 2024!
2024.08: 🎉 One paper is accepted at IEEE Transactions on Evolutionary Computation!
2023.08: I join FPT Software AI Center as an AI Resident in Vietnam!
2021.10: I join VinBigData as a speech research intern in Vietnam!

📝 Publications

ICASSP 2025

Effective Context Modeling Framework for Emotion Recognition in Conversations

Cuong Tran Van*, Thanh V. T. Tran*, Van Nguyen, Truong Son Hy

[Paper] [Code] [Poster]

We design a GNN framework modeling both multiscale and multivariate interactions among modalities and utterances within conversations.
We address class imbalance with a re-weighting scheme in the loss function.

KDD 2025

GROOT: Effective Design of Biological Sequences with Limited Experimental Data

Thanh V. T. Tran*, Nhat Khang Ngo*, Viet Anh Nguyen, Truong Son Hy

[Paper] [Code] [Poster]

We introduce a novel framework using graph-based smoothing to train a surrogate model, which is then used in the optimization process.
We theoretically and empirically show that our technique can expand into extrapolation regions while keeping a reasonable distance from the training data.
Our method can be applied on diverse tasks of different domains.

RESOUND: Speech Reconstruction from Silent Videos via Acoustic-Semantic Decomposed Modeling, Long-Khanh Pham, Thanh V. T. Tran, Minh-Tan Pham, Van Nguyen | [Interspeech 2025] [Project page]
LatentDE: Latent-based Directed Evolution accelerated by Gradient Ascent for Protein Sequence Design, Thanh V. T. Tran, Nhat Khang Ngo, Viet Thanh Duy Nguyen, Truong Son Hy | [Machine Learning: Science and Technology] [Code]
Protein Design by Directed Evolution guided by Large Language Models, Thanh V. T. Tran, Truong Son Hy | [IEEE Transactions on Evolutionary Computation] [Code]

🎖 Honors and Awards

University of Engineering and Technology’s Scholarship
Second prize in ASR Task 1, VLSP 2022 (Certificate)
Third prize in ASR Task 2, VLSP 2022 (Certificate)
Third prize in Scientific Research Contest, University of Engineering and Technology (Certificate)

📖 Educations

2019.08 - 2023.06, Undergraduate Student, University of Engineering and Technology (UET), Vietnam National University

💻 Industry Experience

2023.08 - now, AI Resident, FPT Software AI Center, Hanoi, Vietnam
2021.10 - 2022.11, Spoken Language Processing Department, VinBigData, Hanoi, Vietnam.

Thanh Van Trong Tran

🔥 News

📝 Publications

🎖 Honors and Awards

📖 Educations

💻 Industry Experience