I recently started my PhD in Computer Science at Tsinghua University. I am conducting research at the Center for Speech and Language Technologies (CSLT) under the supervision of Prof. Thomas Fang Zheng. My current work focuses on Visual Speech Recognition, and my broader interests span speech and language technologies, including ASR, multimodal speech understanding, and NLP.

News

December 21, 2024

🎉Paper accepted at ICASSP 2025 Conference

Our research paper 'Sagalee: An Open Source Automatic Speech Recognition Dataset for Oromo Langauge' has been accepted for presentation at the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2025)

June 25, 2023

【毕业季】学校举行2023届优秀毕业生代表座谈会

School held the 2023 Symposium of Outstanding Graduate Representatives

December 6, 2022

Three International Students Awarded the Title: 2022 Excellent International Students of UESTC

Recipient of the 2022 Excellent International Students of UESTC award

Education

2025—Present

Tsinghua University

PhD in Computer Science

2023—2025

Tsinghua University

MSc in Computer Science

2019—2023

University of Electronic Science and Technology of China

BSc in Software Engineering

Publications

Sagalee: An Open Source Automatic Speech Recognitioin Dataset for Oromo language

ICASSP 2025

Sagalee: An Open Source Automatic Speech Recognitioin Dataset for Oromo language

Turi Abu, Shi Ying, Thomas Fang Zheng, Dong Wang

International Journal of Machine Learning and Cybernetics 2024

Unified Deep Learning Model for Multitask Representation and Transfer Learning: Image Classification, Object Detection, and Image Captioning

LY Bayisa, W Wang, Q Wang, CC Ukwuoma, Gutem HK, Endris A and Turi Abu

Journal of Public Health and Environmental Technology 2023

Attention-Based End-to-End Hybrid Ensemble Model for Breast Cancer Multi-Classification

CC Ukwuoma, D Cai, ES gATI, VK Agbesi, G Deribachew, LY Bayisa, and Turi Abu

Experience

2026-present

PhD Researcher CSLT, Tsinghua University

Advisor: Prof. Dong Wang, Prof. Thomas Fang Zheng

Working on Visual Speech Recognition at CSLT.

  • Conducting PhD research in speech and language technologies.
  • Exploring multimodal approaches for robust speech understanding.
Present

Building HayuLabs

Building hayulabs.com, speech AI and AI-assisted text editor for low resource langauges

Previously

Founding Engineer Qilingo

Founding engineer at Qilingo, worked GoAssistant customized assistant with LLM tool use.

October 2022 - March 2023

Intern-AI Engineer HCYtech

Joined the AI department. Key responsibilities included:

  • Developed a smoking detection system using YOLO from surveillance camera feeds.
  • Built a custom chatbot with the OpenAI API and speech-to-text using Whisper.

Portfolio

Mobile Application for Speech Data Collection

Mobile Application for Speech Data Collection

FlutterFirebaseAndroid
  • Developed android application for Automatic Speech Recognition data colllection using Flutter and firebase
Local Retrieval Augmented Generation (RAG) Using Ollama

Local Retrieval Augmented Generation (RAG) Using Ollama

DockerOllamaFlaskRAG
  • Built a local RAG using open source llama model for question answering task
Finetuning LLM for Grammatical Error Correction (GEC)

Finetuning LLM for Grammatical Error Correction (GEC)

PythonPyTorchFlaskReactLLM
  • Fine-tuned T5 Flan-based model on JFLEG dataset for GEC task, and built Grammarly like website for demo
CodeDig - Semantic Code Search Engine

CodeDig - Semantic Code Search Engine

PythonLLMflaskDocument indexing
  • Built a semantic code search engine using ChatGLM API, whoosh library, and Flask.
Distributed Database Management System

Distributed Database Management System

PythonMinIOMySQLFlask
  • Design and implementation of Distributed database system.