Phong (Felix) Do
Hello and welcome! Iβm Phong Do β a first-year PhD student at the University of Warwick, UK, supervised by Dr. Gabriele Pergola. My research focuses on Retrieval-Augmented Generation (RAG) and Knowledge Graphs to advance document retrieval and improve NLP tasks such as question answering and machine reading comprehension.
My journey into AI began at the NLP@UIT lab at University of Information Technology in Ho Chi Minh City, where I specialized in question answering and machine reading comprehension as an undergraduate researcher. This early work laid the foundation for my long-term interest in building systems that can read, understand, and reason over text. After graduation, I continued my involvement with the lab and expanded my experience at Zalo AI, where I spent over three years working on large language models and speech processing. At Zalo, I contributed to speech technologies for Kiki Assistant, a Vietnamese virtual voice assistant installed in over one million cars, helping bring conversational AI into everyday life.
Beyond traditional NLP, I am also pursuing research in GeoAI β the intersection of geography and artificial intelligence. My interest lies in how spatial knowledge and geographic context can be modeled to improve information retrieval and reasoning, particularly in combination with RAG and Knowledge Graphs. By integrating linguistic and spatial perspectives, I aim to develop AI systems that not only understand language but also situate that understanding within the broader context of knowledge, culture, and place.
Research Interests
Iβm passionate about building intelligent retrieval systems that can reason, generalize across modalities and languages, and interact with structured knowledge. My research focuses on two major directions: enhancing Retrieval-Augmented Generation (RAG), and exploring the intersection between Artificial Intelligence and Geography.
π Retrieval-Augmented Generation: Iβm interested in advancing RAG systems to make retrieval more intelligent, interpretable, and adaptable to complex data types.
π Knowledge Graph Reasoning: Leveraging structured knowledge graphs to support logical reasoning and evidence chaining during retrieval.
π Multilingual & Multimodal Embedding: Designing unified embedding spaces for cross-language and cross-modal semantic search - enabling systems to search across text, images, and more.
πΊοΈ AI and Geography: As a geography enthusiast, exploring the intersection between spatial intelligence and artificial intelligence.
- π§ Spatial RAG & Geographic Question Answering: Developing systems that understand and retrieve geospatial information, enabling models to answer geography-based questions and reason over maps, locations, and spatial relationships.
I am happy to chat and discuss potential collaborations. Please feel free to reach out to me via Email (phongdntvn@gmail.com).
π° News
π June 25, 2025 β I will be starting my PhD at the University of Warwick!
I am thrilled to share that I have been awarded a fully-funded scholarship for the PhD in Computer Science program at the University of Warwick, UK. I will be starting this exciting journey in Fall 2025.
π May 16, 2025 β A paper accepted to ACL 2025 (Main)
Our new paper at Zalo with the title "VMLU Benchmarks: A comprehensive benchmark toolkit for Vietnamese LLMs" has been accepted at ACL 2025.
π June 19, 2024 β My paper published at NAACL 2024 (Findings)
Our new paper at the UIT NLP Group with the title "VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding" has been published at NAACL 2024.
