Speech Recognition
and Processing
Tracer Bullet 94-6 |
|
This guide reviews the literature in the collections of the Library
of Congress on speech recognition. Speech recognition is a process
by which the elements of spoken language can be recognized and
analyzed, and the linguistic message it contains transposed into
a meaningful form so that a machine can respond correctly to spoken
commands. The earliest attempts to devise systems for automatic
speech recognition by machine were made in the 1950s. Today, speech
recognition research is interdisciplinary, drawing upon work in
fields as diverse as biology, computer science, electrical engineering,
linguistics, mathematics, physics, and psychology. Within these
disciplines, pertinent work is being done in the areas of acoustics,
artificial intelligence, computer algorithms, information theory,
linear algebra, linear system theory, pattern recognition, phonetics,
physiology, probability theory, signal processing, and syntactic
theory.
Applications of speech recognition have been made in office or
business systems, as well as in manufacturing, medicine, and telecommunications,
and usually concern the recognition and retrieval of information,
such as voice- activated data entry; the control, operation, and
monitoring of various machines and devices; call-processing functions;
and the automation of services normally requiring human beings.
While speech recognition has many short-term applications, it
also has the potential to change daily life profoundly as free
communication between man and machine becomes a reality.
Not intended as a comprehensive bibliography, this guide is designed--as
the name of the series implies--to put the reader "on target."
TOP OF PAGE
Lea, Wayne A. Speech recognition. In McGraw-Hill encyclopedia
of science & technology. v. 17. New York, McGraw-Hill,
c1987. p. 231-237.
Q121.M3 1987 <SciRR>
Roe, David B., and Jay G. Wilpon. Whither speech recognition: the
next 25 years. IEEE communications magazine, v. 31, Nov.
1993: 54-62.
TK5101.A1I13a and Pamphlet box <SciRR>
Rudnicky, Alexander I., Alexander G. Hauptmann, and Kai-Fu Lee.
Survey of current speech technology. Communications of the ACM,
v. 37, Mar. 1994: 52-57.
QA76.A772 and Pamphlet box <SciRR>
TOP OF PAGE
Subject headings used by the Library of Congress, under which books
on speech recognition can be located in most card, book, and online
catalogs, include the following:
Highly Relevant
AUTOMATIC SPEECH RECOGNITION
SPEECH PROCESSING SYSTEMS
Relevant
SPEECH PROCESSING SYSTEMS INDUSTRY
SPEECH SYNTHESIS
Related
MARKOV PROCESSES
More General
COMPUTATIONAL LINGUISTICS
COMPUTER INPUT-OUTPUT EQUIPMENT
PATTERN RECOGNITION SYSTEMS
PERCEPTIONS
SPEECH
SPEECH PERCEPTION
VOICE
TOP OF PAGE
Ainsworth, W. A. Speech recognition by machine. London,
P. Peregrinus on behalf of the Institution of Electrical Engineers,
c1988. 206 p. (IEE computing series, 12)
Bibliography: p. 185-199.
TK7882.S65A33 1988
Pelton, Gordon E. Voice processing. New York, McGraw-Hill,
c1993. 396 p.
Bibliography: p. 375-379.
TK7882.S65P48 1993 <SciRR>
Rabiner, Lawrence R., and B. H. Juang. Fundamentals of speech
recognition. Englewood Cliffs, N.J., PTR Prentice Hall, c1993.
507 p.
Includes bibliographical references.
TK7895.S65R33 1993 <SciRR>
Readings in speech recognition. Edited by Alex Waibel
and Kai-Fu Lee. San Mateo, Calif., Morgan Kaufmann Publishers, c1990.
629 p.
Includes bibliographical references.
TK7882.S65R42 1990 <SciRR>
Voice communication between humans and machines. David
B. Roe and Jay G. Wilpon, editors. Washington, National Academy
Press, 1994. 548 p.
Includes bibliographical references.
Based on a colloquium sponsored by the National
Academy of Sciences.
TK7882.S65V62 1994 <SciRR>
TOP OF PAGE
Acero, Alejandro. Acoustical and environmental robustness
in automatic speech recognition. Boston, Kluwer Academic Publishers,
c1993. 186 p. (The Kluwer international series in engineering and
computer science, SECS 201)
Bibliography: p. 173-183.
TK7882.S65A24 1992
Advances in speech coding. Editors, Bishnu S. Atal, Vladimir
Cuperman, Allen Gersho. Boston, Kluwer Academic Publishers, c1991.
378 p. (The Kluwer international series in engineering and computer
science, SECS 114)
Papers based on presentations at the IEEE Workshop on Speech Coding
for Telecommunications, held in Vancouver, British Columbia, Canada,
from September 5 to 8, 1989.
Includes bibliographical references.
TK7882.S65A28 1991
Analysis and synthesis of speech: strategic research towards
high-quality text-to-speech generation. Edited by Vincent J.
van Heuven, Louis C. W. Pols. Berlin, New York, Mouton de Gruyter,
1993. 420 p. (Speech research, 11)
Bibliography: p. 365-406.
TK7882.S65A55 1993
Bourlard, Herve, and Nelson Morgan. Connectionist speech recognition:
a hybrid approach. Boston, Kluwer Academic Publishers, c1994.
312 p. (The Kluwer international series in engineering and computer
science, SECS 247)
Bibliography: p. 281-306.
TK7882.S65B69 1994
Church, Kenneth Ward. Phonological parsing in speech recognition.
Boston, Kluwer Academic Publishers, c1987. 261 p. (The Kluwer international
series in engineering and computer science, SECS 38)
Bibliography: p. 189-198.
TK7882.S65C49 1987
Flanagan, James L. Speech analysis: synthesis and perception.
2nd ed. Berlin, New York, Springer-Verlag, 1972. 444 p. (Kommunikation
und Kybernetik in Einzeldarstellungen, 3)
Bibliography: p. 406-426.
QP306.F57 1972
Furui, Sadaoki. Digital speech processing, synthesis, and recognition.
New York, Marcel Dekker, c1989. 390 p. (Electrical engineering and
electronics, 55)
Bibliography: p. 355-375.
TK7882.S65F87 1989
Huang, X. D., Y. Ariki, and Mervyn A. Jack. Hidden Markov models
for speech recognition. Edinburgh, Edinburgh University Press,
c1990. 276 p. (Edinburgh information technology series, 7)
Includes bibliographical references.
TK7882.S65H83 1990
Lee, Kai-Fu. Automatic speech recognition: the development
of the SPHINX system. Boston, Kluwer Academic Publishers, c1989.
207 p. (The Kluwer international series in engineering and computer
science, SECS 62)
Bibliography: p. 187-203.
TK7882.S65L44 1989
Markel, John D., and Augustine H. Gray. Linear prediction of
speech. Berlin, New York, Springer-Verlag, 1976. 288 p. (Communication
and cybernetics, 12)
Bibliography: p. 278-284.
TK7882.S65M37
Papamichalis, Panos E. Practical approaches to speech coding.
Englewood Cliffs, N.J., Prentice-Hall, c1987. 322 p.
Includes bibliographical references.
TK7882.S65P36 1987
Quackenbush, Schuyler R., T. P. Barnwell, and Mark A. Clements.
Objective measures of speech quality. Englewood Cliffs,
N.J., Prentice Hall, c1988. 377 p.
Includes bibliographical references.
TK7882.S65Q33 1988
Rabiner, Lawrence R., and Ronald W. Schafer. Digital processing
of speech signals. Englewood Cliffs, N.J., Prentice-Hall, c1978.
512 p.
Includes bibliographical references.
TK7882.S65R3
Riley, Michael D. Speech time-frequency representations.
Boston, Kluwer Academic Publishers, c1989. 160 p. (The Kluwer international
series in engineering and computer science, SECS 63)
Bibliography: p. 149-155.
TK7882.S65R55 1989
Russell, M. J. The development of the speaker independent ARM
continuous speech recognition system. London, Controller, H.M.S.O.,
c1992. 26 p. (Royal Signals and Radar Establishment memorandum,
no. 4473)
Bibliography: p. 24-26.
TK7895.S65R87 1992
Speech and audio coding for wireless and network applications.
Edited by Bishnu S. Atal, Vladimir Cuperman, Allen Gersho. Boston,
Kluwer Academic Publishers, c1993. 283 p.
Includes bibliographical references.
TK7882.S65S6318 1993
Speech input and output assessment: multilingual methods and
standards. Editors, A. Fourcin and others. Chichester, Eng.,
Horwood; New York, Halsted Press, 1989. 290 p.
Bibliography: p. 239-271.
TK7882.S65S664 1989
Speech and language processing. Edited by C. Wheddon and
R. Linggard. London, New York, Chapman and Hall, 1990. 339 p.
Includes bibliographical references.
TK7882.S65S632 1990
Visual representations of speech signals. Edited by Martin
Cooke, Steve Beet, and Malcolm Crawford. Chichester, Eng., New York,
J. Wiley & Sons, c1993. 385 p.
Includes bibliographical references.
QP306.V55 1993
Weibel, Alexander. Prosody and speech recognition. London,
Pitman; San Mateo, Calif., Morgan Kaufmann Publishers, 1988. 212
p.
Bibliography: p. 199-212.
TK7882.S65W45 1988
Yannakoudakis, E. J., and P. J. Hutton. Speech synthesis and
recognition systems. Chichester, Eng., Horwood; New York, Halsted
Press, 1987. 184 p.
Bibliography: p. 173-177.
TK7882.S65Y36 1987
TOP OF PAGE
Advances in speech, hearing, and language processing.
London, Greenwich, Conn., JAI Press, c1990- annual.
TK7882.S65A285
Cater, John P. Electronically hearing: computer speech recognition.
Indianapolis, H. W. Sams, c1984. 263 p.
Bibliography: p. 250-252.
TK7882.S65C35 1984
Deller, John R., John G. Proakis, and John H. L. Hansen. Discrete-time
processing of speech signals. New York, Macmillan, c1993. 908
p.
Bibliography: p. 852-897.
TK7882.S65D44 1993
Digital speech processing: speech coding, synthesis, and recognition.
Edited by A. Nejat Ince. Boston, Kluwer Academic Publishers, c1992.
242 p. (The Kluwer international series in engineering and computer
science, SECS 155)
Bibliography: p. 209-238.
TK7882.S65D54 1992
Electronic speech recognition: techniques, technology, and
applications. Edited by Geoff Bristow. New York, McGraw-Hill,
1986. 395 p.
Includes bibliographical references.
TK7882.S65E43 1986
Flanagan, James L. Speech synthesis. Edited by James L.
Flanagan and Lawrence R. Rabiner. Stroudsburg, Pa., Dowden, Hutchinson
& Ross, 1973. 511 p.
Includes bibliographical references.
TK7882.S65F55
Fundamentals of speech synthesis and speech recognition: basic
concepts, state of the art and future challenges. Edited by
Eric Keller. Chichester,
TK7882.S65F86 1994
Eng., New York, Wiley, c1994. c300 p.
Interactive speech technology: human factors issues in the
application of speech input/output to computers. Edited by
Christopher Baber and Janet M. Noyes. Bristol, Pa., Taylor &
Francis, c1993. 212 p.
Includes bibliographical references.
QA76.9.H85I593 1993
Papers in speech communication. Speech perception. Joanne
L. Miller, Raymond D. Kent, Bishnu S. Atal, editors. Woodbury, N.Y.,
Acoustical Society of America, c1991. 764 p.
Includes bibliographical references.
This volume is part of a three-volume series entitled
Papers in speech communication; the three volumes are Speech
production, Speech perception, and Speech processing; Published
by the Acoustical Society of America through the American Institute
of Physics.
BF463.S64P36 1991
Papers in speech communication. Speech processing. Bishnu
S. Atal, Joanne L. Miller, and Raymond D. Kent, editors. Woodbury,
N.Y., Acoustical Society of America, c1991. 655 p.
Includes bibliographical references.
This volume is part of a three-volume series entitled
Papers in speech communication; the three volumes are Speech
production, Speech perception, and Speech processing; Published
by the Acoustical Society of America through the American Institute
of Physics.
TK7882.S65P36 1991
Papers in speech communication. Speech production. Raymond
D. Kent, Bishnu S. Atal, Joanne L. Miller, editors. Woodbury, N.Y.,
Acoustical Society of America, c1991. 858 p.
Includes bibliographical references.
This volume is part of a three-volume series entitled
Papers in speech communication; the three volumes are Speech
production, Speech perception, and Speech processing; Published
by the Acoustical Society of America through the American Institute
of Physics.
QP306.P36 1991
Parsons, Thomas W. Voice and speech processing. New York,
McGraw-Hill, c1986. 402 p.
Includes bibliographical references.
TK7882.S65P37 1986
Prochnow, Dave. Chip talk: projects in speech synthesis.
Blue Ridge Summit, Pa., Tab Books, c1987. 209 p.
Bibliography: p. 197-199.
TK7882.S65P76 1987
Saito, Shuzo, and Kazuo Nakata. Fundamentals of speech signal
processing. New York, Academic Press, 1985. 266 p.
Includes bibliographical references.
TK7882.S65S24 1985
Sclater, Neil. Introduction to electronic speech synthesis.
Indianapolis,
TK7882.S65S34 1983
H. W. Sams, c1983. 134 p.
Speech analysis. Edited by Ronald W. Schafer, John D.
Markel. New York, IEEE Press, c1979. 469 p.
Includes bibliographical references.
TK7882.S65S63
Speech processing. Edited by Chris Rowden. London, New
York, McGraw-Hill, c1992. 405 p.
Includes bibliographical references.
TK7882.S65S675 1991
Vysotsky, George. Automatic speech recognition research in
the USSR. Falls Church, Va., Delphic Associates, c1986. 143
p.
Bibliography: p. 139-143.
TK7882.S65V97 1986
Witten, I. H. Making computers talk: an introduction to speech
synthesis. Englewood Cliffs, N.J., Prentice-Hall, c1986. 150
p.
Includes bibliographical references.
TK7882.S65W55 1986
Zemlin, Willard R. Speech and hearing science: anatomy and
physiology. 3rd ed. Englewood Cliffs, N.J., Prentice-Hall,
1988. 603 p.
Includes bibliographical references.
QP306.Z44 1988 <SciRR>
TOP OF PAGE
DARPA Speech and Natural Language Workshop (3rd, 1990, Hidden Valley,
Pa.). Speech and natural language. Proceedings of a Workshop
held at Hidden Valley, Pennsylvania, June 24-27, 1990. Sponsored
by Defense Advanced Research Projects Agency, Information Science
and Technology Office. San Mateo, Calif.; Distributed by Morgan
Kaufmann Pub., Inc., 1990. 452 p.
Includes bibliographical references.
TK7895.S65D37 1990
ESCA Tutorial Day and Research Workshop on Speech Synthesis (1990,
Autrans, France). Talking machines: theories, models, and designs.
Edited by G. Bailly, C. Benoit, assisted by T. R. Sawallis. Amsterdam,
New York, North-Holland, 1992. 523 p.
Includes bibliographical references.
TK7882.S65E83 1990
ICASSP [International Conference on Acoustics, Speech, and Signal
Processing] Proceedings. New York, Institute of Electrical
and Electronics Engineers; Piscataway, N.J., Available from IEEE
Service Center, c1980- annual.
TK7882.S65I37a
NATO Advanced Study Institute on Recent Advances in Speech Understanding
and Dialog Systems (July 5-18, 1987, Bad Windsheim, Germany). Recent
advances in speech understanding and dialog systems. Edited
by Heinrich Niemann, M. Lang, Gerhard Sagerer. Berlin, New York,
Springer-Verlag, c1988. 521 p. (NATO ASI series. Series F, Computer
and system sciences, v. 46)
Includes bibliographical references.
TK7882.S65N36 1987
NATO Advanced Study Institute on Speech Recognition and Understanding:
Recent Advances, Trends, and Applications (July 1-13, 1990, Cetraro,
Italy). Speech recognition and understanding: recent advances,
trends, and applications. Edited by Pietro Laface, Renato De
Mori. Berlin, New York, Springer-Verlag, c1992. 559 p. (NATO ASI
series. Series F, Computer and systems sciences, vol. 75)
Includes bibliographical references.
TK7882.S65N38 1990
Voice Input/Output Applications Show and Conference. The official
proceedings of Speech Tech. New York, Media Dimensions, 1984-
annual.
TK7882.S65V63
TOP OF PAGE
Technical Reports and other types of literature may be identified
in the following abstracting and indexing journals:
Government Reports Announcements & Index (1895- )
Z7916.G78 <SciRR A&I> and CD-ROM <SciRR>
Monthly Catalog of United States Government Publications
(1895- )
Z1233.A18 <SciRR> and CD-ROM <MRR>
<N&CPR?
TOP OF PAGE
Reports listed below are available in the microform collection
of the Science Reading Room. They and other reports on the subject
may also be purchased from the National Technical Information Service
(NTIS), Springfield, Virginia 22161. NTIS telephone: (703) 487-4650.
Bates, John K. Phoneme recognition system. Prepared for
the National Science Foundation, Small Business Innovation Research
Programs. Pleasantville, N.Y., VOIS, Inc., 1989. 48 p.
PB93-158046
Clements, Mark A., and others. Automatic recognition of speech
in stressful environments. Aberdeen Proving Ground, Md., U.S.
Army Human Engineering Laboratory, 1991. 71 p. (Technical memorandum
14-91)
AD-A242917
Danforth, Douglas G. An empirical investigation of sparse distributed
memory using discrete speech recognition. Prepared for the
National Aeronautics and Space Administration. Moffett Field, Calif.,
Research Institute for Advanced Computer Science, 1990. 29 p. (RIACS
Technical report, 90.18).
N92-124379
Olorenshaw, Lex, and David Trawick. Syntactic error modeling
and scoring normalization in speech recognition: error modeling
and scoring normalization in the speech recognition task for adult
literacy training. Prepared for the National Aeronautics and
Space Administration. Moffett Field, Calif., Research Institute
for Advanced Computer Science, 1991. 41 p. (NASA-CR- 189938)
N92-20548
Ostendorf, Mari, and J. Robin Rohlicek. Segment-based acoustic
models for continuous speech recognition, Progress report: 1 October
92 - 30 September 93. Submitted to Office of Naval Research
and Advanced Research Projects Administration. Boston, Mass., Dept.
of Electrical Computer and Systems Engineering, Boston University,
1993. 12 p.
AD-A271483
Paul, Douglas B. An efficient stack decoder algorithm for continuous
speech recognition with a stochastic language model. Prepared
for Defense Advanced Research Projects Agency. Lexington, Mass.,
Massachusetts Institute of Technology, Lincoln Laboratory, 1991.
21 p. (Technical report 930)
AD-A240745
Russell, Martin J. A segmental hidden Markov model for speech
pattern processing. Prepared for Defence Research Information
Centre, Orpington, Eng. Malvern, Eng., Defence Research Agency,
1992. 36 p. (Royal Signals and Radar Establishment, memo 4599)
AD-A258220
Weinstein, Clifford J. Opportunities for advanced speech processing
in military computer-based systems. Prepared for Defense Advanced
Research Projects Agency. Lexington, Mass., Massachusetts Institute
of Technology, Lincoln Laboratory, 1991. 51 p. (Technical report
940)
AD-A233724
TOP OF PAGE
Abstracting and Indexing Services that index relevant journal articles
and other liteature on speech recognition are listed below. Some
suggested terms are given as aids in searching. Note: Consult reference
librarian for location of abstracting and indexing services in the
Science Reading Room. The following indexes are available in most
public and college libraries:
ACM Guide to Computing Literature (1977-)
QA75.5.A75a <SciRR A&I>
See: Voice
Speech
Applied Science & Technology Index (1913-)
Z7913.I7 <SciRR A&I> <MRR> <N&CPR>
and CD-ROM <SciRR>
See: Automatic Speech Recognition
Automatic Speech Recognition Devices
Hidden Markov Models
Speech Processing Systems
Speech Synthesis
Artificial Intelligence Abstracts (1984-1992)
Q334.A76 <SciRR A&I>
See: Speech Recognition
Speech Synthesis
Voice Recognition
Computer Abstracts (1957-)
Z6654.C17C64 <SciRR A&I>
See: Speech Recognition
Voice Communication
Voice Recognition
Computer and Control Abstracts (1966-)
QA76.C548 <SciRR A&I> and CD-ROM <SciRR>
See: Section 12.50C Speech Recognition
Section 55.85 Speech Recognition and Synthesis
Computer and Information Systems Abstracts (1962-)
QA76.I46 <SciRR A&I>
See: Speech Recognition
Voice/Data Communication Systems
Computing Reviews (1960-)
QA76.C5854 <SciRR A&I>
See: Section B.4 Input/Output and Data Communication
Current Technology Index (1962-)
Z7913.B7 <SciRR A&I>
See: Section 11 Communication--Speech Sounds
Section 12 Control and Computer Technology--Computers--Speech Recognition
Machines
Engineering Index (1884-)
Z5851.E62 <SciRR A&I> and CD-ROM <SciRR>
See: Speech Analysis
Speech Coding
Speech Communication
Speech Intelligibility
Speech Processing
Speech Recognition
Speech Synthesis
Speech Transmission
Voice/Data Communication Systems
General Science Index (1978-)
Z7401.G46 <SciRR A&I> <MRR> <N&CPR>
See: Speech
Speech Processing Systems
Voice
Index Medicus (1960-)
Z6660.I42 <SciRR A&I> and CD-ROM <SciRR>
See: Headings beginning with Speech or Voice
Information Science Abstracts (1966-)
Z699.A1D6 <SciRR A&I>
See: Speech Recognition
Voice
Section on Communications and Telecommunications Systems
Magazine Index (1980-)
Available in several formats in LC
See: Speech Processing Systems
Speech Processing Systems Industry
Voice I O Device
Voice I O Equipment
Voice Recognition
Psychological Abstracts (1927-)
BF1.P65 <SciRR A&I> and CD-ROM <SciRR>
See: Speech Perception
Speech Processing (Mechanical)
Synthetic Speech
Readers' Guide to Periodical Literature (1900-)
AI3.R48 <B&E> <MRR> <N&CPR>
See: Speech Processing Systems
Science Abstracts. Series A, Physics Abstracts (1898-)
QC1.P46 <SciRR A&I> and CD-ROM <SciRR>
See: Section 8736 Speech and Biocommunications
Section 4370 Speech Communication
Science Abstracts. Series B, Electrical & Electronics Abstracts
(1898- )
Z5833.E37 <SciRR A&I> and CD-ROM <SciRR>
See: Section 6130 Speech Analysis and Processing
Techniques
Section 6200 Telecommunication
TOP OF PAGE
Journals that often contain relevant articles to speech recognition
include the following:
Acoustical Society of America. Journal
|
QC221.A4 |
ASR News [Automatic Speech Recognition
News] |
HD9696.S64A87 |
Computer Speech & Language |
TK7882.S65C63 |
IEE Proceedings. I, Communications, Speech,
and Vision |
TK1.I13817 |
IEEE Signal Processing Magazine |
TK5981.I143 |
IEEE Signal Processing Letters |
TK5102.9.I32 |
IEEE Transactions on Signal Processing |
TK5981.I2 |
IEEE Transactions on Speech and Audio Processing
|
TK7882.S65I38 |
Speech Recognition Update |
TK7882.S65S677 |
Voice Processing Magazine |
WMLC 93/4040 |
Voice Technology News |
<N&CPR> Current Issues
Only |
TOP OF PAGE
Algazi, V. Ralph, and others. Transform representation of the spectra
of acoustic speech segments with applications - II: speech analysis,
synthesis and coding. IEEE transactions on speech and audio
processing, v. 1, July 1993: 277-286.
TK7882.S65I38
Bahl, Lalit R., and others. Multonic Markov word models for large
vocabulary continuous speech recognition. IEEE transactions
on speech and audio processing, v. 1, July 1993: 334-344.
TK7882.S65I38
Banks, D. Localisation and separation of simultaneous voices with
two microphones. IEE proceedings--I. Communications, speech
and vision, v. 140, Aug.1993: 239-244.
TK1.I13817
Berger, Shaul, and others. An application specific DSP for speech
applications. IEEE transactions on consumer electronics,
v. 39, Nov. 1993: 733-739.
TK6563.I2
Bocchieri, E.L., and J. G. Wilpon. Discriminative feature selection
for speech recognition. Computer speech and language, v.
7, July 1993: 229-246.
TK7882.S65C63
Castello, Rinaldo, and Luciano Tomasini. A BiCMOS speech circuit
with only two external components. IEEE journal of solid-state
circuits, v. 28, July 1993: 770-777.
TK7871.85.I23
Chang, Pao-Chung, Sin-Horng Chen, and Biing-Hwang Juang. Discriminative
analysis of distortion sequences in speech recognition. IEEE
transactions on speech and audio processing, v. 1, July 1993:
326-333.
TK7882.S65I38
De Mori, Renato, and Giovanni Flammia. Speaker-independent consonant
classification in continuous speech with distinctive features and
neural networks. In Acoustical Society of America. Journal,
v. 94, Dec. 1993: 3091-3103.
QC221.A4
Gupta, V., and others. Use of minimum duration and energy contour
for phonemes to improve large vocabulary isolated-word recognition.
Computer speech and language, v. 6, Oct. 1992: 345-359.
TK7882.S65C63
Gupta, V., M. Lennig, and P. Mermelstein. A language model for
very large- vocabulary speech recognition. Computer speech and
language, v. 6, Oct. 1992: 331-344.
TK7882.S65C63
Lee, C.-H., and others. Improved acoustic modeling for large vocabulary
continuous speech recognition. Computer speech and language,
v. 6, Oct. 1992: 103-127.
TK7882.S65C63
Pai, Hsiao-Fen, and Hsiao-Chuan Wang. A study of two-dimensional
cepstrum approach for speech recognition. Computer speech and
language, v. 6, Oct. 1992: 361-375.
TK7882.S65C63
Schwin, Gene. Voice I/O for the industrial environment. Material
handling engineering, July 1993: 26.
TS149.F45
Snyder, J. H., and others. Tools for real-time signal-processing
research: new tools that facilitate the transfer of speech-processing
algorithms from mainframes or workstations to DSP hardware.
IEEE communications magazine, v. 31, Nov. 1993: 64-74.
TK5101.A1I13a
Speech recognition planned for more than 150 airport towers. Speech
recognition update, May 1993: 1,3.
TK7882.S65S677
TOP OF PAGE
Selected materials available in the Science Reading Room pamphlet
boxes include the following:
Andrews, David L. IBM and Apple work to perfect voice input. Byte,
v. 18, Aug. 1993: 32.
Bertolucci, Jeff. Cheap talk: low-cost speech recognition.
PC world, v. 11, Feb. 1993: 72.
Computers are learning to listen. Today's science on file,
v. 1, Jan. 1993: 97-99.
Crane, Hewitt D., and Dimitry Rtischev. Pen and voice unite. Byte,
v. 18, Oct. 1993: 98,99,102.
Freyd, Gwendolyn. Friendly chats with computers. Technology
review, v. 95, May/June 1992: 13-14.
Hammond, G. T. Deploying speech technology in telephone banking
services. Speech technology, v. 5, Apr./May 1990: 30-33.
Levinson, Stephen E., Joseph P. Olive, and Judith S. Tschirgi.
Speech synthesis in telecommunications. IEEE communications
magazine, v. 31, Nov. 1993: 46-53.
Low-cost speech recognition for personal computers. Speech
recognition update, Jan. 1993: 1-6.
McCrone, John. Computers that listen. New scientist, v.
140, Dec. 4, 1993: 30-35.
Oberteuffer, John A. Applications of advanced speech technologies.
Voice processing magazine, v. 5, Apr. 1993: 56-62.
Pennisi, Elizabeth. Talking maps. Science news, v. 142,
Dec. 5, 1992: 392-393.
Peterson, Ivars. Speech for export. Science news, v. 144,
Oct. 16, 1993: 254-5.
Peterson, Ivars. Voices in command. Science news, v. 143,
Apr. 3, 1993: 222-223.
Poole, Lon. Useful voice processor for Mac 1.1v7. Macworld,
v. 10, Dec. 1993: 63, 65.
Pratt, David. Automatic voice recognition in the 1990s: with a
little help. Speech technology, v. 5, Apr./May 1990: 40.
Tobin, Paul E. On the future of voice technology in military applications.
Speech technology, v. 5, Apr./May 1990: 76-78.
Zimmerman, Robert. Computer ear. The Sciences, v. 34,
Jan./Feb. 1994: 9-10.
TOP OF PAGE
Signal Processing Society
c/o Institute of Electrical and Electronics Engineers
345 E. 47th St.
New York, NY 10017
Telephone: (212) 705-7900
FAX: (212) 705-4929
Acoustical Society of America
500 Sunnyside Blvd.
Woodbury, NY 11797
Telephone: (516) 576-2366
FAX: (516) 576-2377
TOP OF PAGE
|