| |
My name is Hao Tang and today, I would like to introduce
myself. Through this month's tutorial, I'm going to introduce you to
my previous work, my current research interests and my vision for
ISIP.
I am currently a graduate student studying toward a Ph.D. degree in
the Department of Electrical and Computer Engineering at Mississippi
State University. I received my B.S. and M.S. degrees, both in
Electrical Engineering, from the University of Science and Technology
of China, in 1998 and 2003, respectively.
During my graduate study in China, I held a senior research position
at iFlyTEK, a leading Chinese speech technology provider. This
position provided me with plenty of exposure to the research. I have
led and participated in over fifteen projects at iFlyTEK and my
previous work made me win three scientific and technical awards from
the governments, of which one is the National Scientific and
Technical Progress Award conferred by the State Council of
P. R. China. In addition to these accomplishments, I have three
pending Chinese patents.
My past research fell mainly under the area of speech synthesis. I
have been a member of the team that developed a number of Chinese
Speech Synthesis Systems. You can evaluate the performance of these
speech synthesis systems, by clicking on the links given below. These
links point to the three synthetic wave files, of which the first one
was generated by our KBCT2.0 Cantonese TTS engine, the next one was
generated by our KBCE2.0 Chinese-English Mixed-lingual TTS engine, and
the last one was generated by our KD2000 Chinese TTS engine. (Note:
KBCT2.0, KBCE2.0 and KD2000 are trademarks of iFlyTEK.)
KBCT2.0 (Prototype) Apr. 2003 Wave file 1
KBCE2.0 (Product) Sep. 2002 Wave file 2
KD2000 (Product) Jun. 2000 Wave file 3
Apart from speech synthesis, I have also worked in the area of
distributed and embedded speech computing. I proposed an innovative
concept Distributed Speech Synthesis aimed at developing high
naturalness speech synthesis systems on resource-sensitive mobile and
embedded devices. I also developed several prototype systems under the
proposed architecture.
Why do I want to pursue a Ph.D. and why do I want to make a transition
from speech synthesis to speech recognition ? The answer to both these
questions is simple. My career goal is to become a faculty and direct
a research lab. In order to become a professor and conduct in-depth
research, a Ph.D. degree is a must. I believe that speech recognition
is one of the most fundamental and challenging field and thus deserves
much more research efforts than speech synthesis.
While working in ISIP, I will have ample of opportunities to get
involved in speech recognition research. My experience in industry has
helped me develop my own style of learning. That is, to learn and
master knowledge from practical work. ISIP provides an environment
that emphasizes practical work and that reflects my style of learning.
My first project in ISIP is to develop a speaker verification
system. Speaker verification is indeed a perfect start for me because
the underlying theory of speaker verification is conceptually similar
to speech recognition. I am sure the experience and knowledge I gain
from this project will definitely benefit my future work on the more
complicated and challenging speech recognition problems.
|
| |
|