Tom Veatch: Resume for Technical Consulting
in Speech Technology and related applications

Programming environments and Tools

Programming languages with completed projects: C, C++, Java, JavaScript, perl, awk, {ba,{{t}c}}sh, {my}sql. Pascal, LISP, BASIC, FORTRAN,
IDE's: emacs, MSVC++, MSVB, {SG,HT,X,VoiceX}ML.
Operating Systems: Un*xes including Solaris, several Linuxes, FreeBSD, WinXP, Win2000, Win98, WinNT, Win95, Win3.1, DOS 6.2
Hardware Platforms: Wintel and Unix PC's, Mac's, Sun/SGI workstations
Relevant API's: Microsoft Win32, MFC, SAPI, TAPI, MAPI; Novell SRAPI. Entropic's TrueTalk TTS API and HTK speech recognition API. Linkon Teravox and Device Driver Interface. Sun XTL telephony API. Some familiarity with messaging/networking protocols.
IVR systems: designed and implemented high-speed IVR application generator.
Voice Email: design and implementation currently in progress.
Lip Synch Machine: built the system.

Theory

Automatic speech recognition: TV is fully grounded in the fundamentals of HMM training and decoding; can teach it with or without the basic equations. He is a competent user of the HTK HMM Tool Kit, used by its authors to build the most accurate systems in the world for continuous, large vocabulary, speaker independent ASR (cf. ARPA Benchmark Tests). He is capable of building state-of-the-art HMM models, and effective recognition-based systems.
Text to speech synthesis: TV has deep scientific and broad engineering understanding of text to speech synthesis technology. he wrote the first type-and-talk TTS web page anywhere (Dec. 1994), for Entropic, using AT&T's TTS. (TV's thesis adviser, Mark Liberman, wrote the first version of AT&T's TTS.) TV is competent at optimizing synthesized speech parameters from intonation to phoneme transcriptions. He is an active consultant on large-scale synthesis-based telephone apps (references available on request). He is capable of porting tts engines to new platforms and APIs as well as using stable APIs in new applications. An ongoing development project uses TTS for reading email over the phone.
Speech science: TV's thesis was titled "English Vowels: Their Surface Phonology and Phonetic Implementation in Vernacular Dialects", and included chapters on acoustics, computational analysis of speech data, and a number of dialects (LA Chicano English, Alabama and Chicago White English, and Jamaican English Creole). TV taught phonetics, speech recognition, social/dialect variation and methodology at Stanford University in 1991-93. He has presented a large number of competitively reviewed as well as invited papers at the Acoustical Society of America, the Linguistic Society of America, the NWAVE sociolinguistics conference, etc. He has some skill in identifying the native geographical dialect of English speakers.
Other areas: TV has considerable background in:
- Electrical engineering: linear systems, FFT methods, stochastic signal processing, information theory, Hidden Markov models, &c.
- Computer science: several programming languages and methodologies, artificial intelligence, computational linguistics (recognition, synthesis, parsing), &c.
- Statistics: exploratory data analysis, hypothesis testing, a variety of uni- and multi-variate analysis and modeling techniques, statistical natural language processing, &c.
- Dialog models: discourse analysis (e.g. Polanyi's models)

Tom Veatch: Resume for Technical Consulting in Speech Technology and related applications

Programming environments and Tools

Theory

Copyright © 1996 Sprex, Inc. All rights reserved. Modified: November 22, 1996

Tom Veatch: Resume for Technical Consulting
in Speech Technology and related applications

Copyright © 1996 Sprex, Inc. All rights reserved.
Modified: November 22, 1996