Make your own free website on Tripod.com
Published Articles

At the time of writing, I’ve jointly edited one book, written another, had well over 40 conference and journal papers published, and written quite a few other reports and presentations. I’ve also had some circuits published in trade journals and a letter in the Radio Times.

Anyway, all the important stuff is listed below, and feel free to contact me if you want copies of them. If you follow the links on the titles, you’ll find Zip files containing Adobe Acrobat versions of some of the more interesting papers. Actually, they’re all interesting (if you’re in the right frame of mind) – I just haven’t got electronic copies of some of them.

Book:

Visual Representations of Speech Signals. M. P. Cooke, S. W. Beet and M. D. Crawford (editors). John Wiley and Sons, Ltd., 1993.

Ph.D. Thesis:

Digital Processing of Speech Produced in Hyperbaric Helium. University of Liverpool (UK). S. W. Beet. September 1985.

Articles:

Making helium speech intelligible. S. W. Beet and C. C. Goodyear. IEE Electronics Division Colloquium Digest No. 1983/31, pp 11/1-5. April 1983.

Helium speech processor using linear prediction. S. W. Beet and C. C. Goodyear. Electronics Letters, Vol. 19, No. 11, pp 408-410. May 1983.

Speech formant shifts in hyperbaric helium. S. W. Beet and C. C. Goodyear. Proceedings of the IoA Spring Conference, Acoustics-84, Swansea, April 1984, pp 415-419.

The acoustic flow of speech. R. K. Moore, M. J. Tomlinson and S. W. Beet. Proceedings of the IoA, Vol. 6, Pt. 4, pp 241-248. 1984.

Auditory modelling for automatic speech recognition. S. W. Beet, R. K. Moore and M. J. Tomlinson. Proceedings of the IoA, Vol. 8, Pt. 7, pp 571-579. 1986.

Improved speech recognition using a reduced auditory representation. S. W. Beet, H. E. G. Powrie, R. K. Moore and M. J. Tomlinson. Proceedings of the IEEE conference ICASSP-88, New York, April 1988, pp 75-78.

Automatic speech recognition using a reduced auditory representation and position-tolerant discrimination. S. W. Beet. Computer Speech and Language, Vol. 4, pp 17-33. January 1990.

Optimising time and frequency resolution in the Reduced Auditory Representation. S. W. Beet and I. R. Gransden. Proceedings of the ESCA Workshop “Comparing Speech Signal Representations”, Sheffield, April 1992, pp 101-108.

Interfacing an auditory model to a parametric speech recogniser. S. W. Beet and I. R. Gransden. Proceedings of the IoA , Vol. 14, Pt. 6, pp 321-328. 1992.

The time and frequency resolution of an auditory representation of speech. S. W. Beet and I. R. Gransden. In “Visual Representations of Speech Signals”, eds.: M. P. Cooke, S. W. Beet and M. D. Crawford, pp 175-179. John Wiley and Sons, Ltd., 1993.

Computationally efficient methods for calculating instantaneous frequency for auditory analysis. I. R. Gransden and S. W. Beet. Proceedings of the ESCA conference Eurospeech ’93, Berlin, September 1993, pp 385-388.

Adaptive control using radial basis function networks. P. A. Moakes and S. W. Beet. Proceedings of the IEE conference Control ’94, Brighton, March 1994, pp 1453-1457.

Flow-based prediction: a method for improved speech recognition. L. Baghai-Ravary, S. W. Beet and M. O. Tokhi. IEE Electronics Division Colloquium Digest No. 1994/138, pp 5/1-5. June 1994.

Non-linear speech analysis using recurrent radial basis function networks. P. A. Moakes and S. W. Beet. In “Neural Networks for Signal Processing IV”, eds.: J. Vlontzos, J-N. Hwang and E. Wilson, pp 319-328. IEEE Press, 1994.

Non-stationary prediction of frame-based speech data. S. W. Beet, L. Baghai-Ravary and M. O. Tokhi. In “Signal Processing VII: Theories and applications”, eds.: M. Holt, C. Cowan, P. Grant and W. Sandham, pp 1649-1652. EURASIP, 1994.

The noise robustness of auditory front-ends in HMM based speech recognisers. I. R. Gransden and S. W. Beet. In “Signal Processing VII: Theories and applications”, eds.: M. Holt, C. Cowan, P. Grant and W. Sandham, pp 1653-1656. EURASIP, 1994.

Analysis of non-linear speech generating dynamics. P. A. Moakes and S. W. Beet. Proceedings of the Acoustical Society of Japan conference ICSLP ’94, Yokohama, September 1994, pp 1039-1042.

Combining auditory representations using fuzzy sets. I. R. Gransden and S. W. Beet. Proceedings of the Acoustical Society of Japan conference ICSLP ’94, Yokohama, September 1994, pp 1047-1050.

Removing redundancy from some common representations of speech. L. Baghai-Ravary, S. W. Beet and M. O. Tokhi. Proceedings of the IoA, Vol. 16, Pt. 5, pp 467-474. 1994.

Combining auditory representations. I. R. Gransden and S. W. Beet. Refereed paper, Proceedings of the IoA, Vol. 16, Pt. 5, pp 191-198. 1994.

Recurrent radial basis functions for speech period detection. P. A. Moakes and S. W. Beet. Refereed paper, Proceedings of the IoA, Vol. 16, Pt. 5, pp 271-278. 1994.

Radial basis function networks for noise reduction of speech. P. A. Moakes and S. W. Beet. Proceedings of the 4th IEE International Conference on Artificial Neural Networks, Cambridge, June 1995, pp 7-12.

Adaptive flux interpolation, flow-based prediction, delta or delta-delta coefficients: which is best? L. Baghai-Ravary, S. W. Beet and M. O. Tokhi. Proceedings of the ESCA conference Eurospeech ’95, Madrid, September 1995, pp 1037-1040.

Automatic analysis of individual voice characteristics. S. W. Beet, P. A. Cudd, S. P. Whiteside and J. E. H. Noad. Proceedings of the IPSM & BES Annual Conference, IBEX ’95, Sheffield, September 1995, p 115.

Stripwise image warping. L. Baghai-Ravary and S. W. Beet. Proceedings of ICSPAT ’95, Boston, September 1995, pp 954-958.

Towards the automation of personal voices for VOCAs. S. W. Beet, S. P. Whiteside, D. H. Li, P. A. Cudd and J. E. H. Noad. Proceedings of ECART-3, Lisbon, October 1995, pp 155-157.

Active shape models for visual speech feature extraction. J. Luettin, N. A. Thacker and S. W. Beet. In “Speechreading by Humans and Machines”, eds.: D. G. Stork and M. E. Hennecke, pp 383-390. Springer Verlag, 1996.

The two-dimensional discrete cosine transform applied to speech data. L. Baghai-Ravary, S. W. Beet and M. O. Tokhi. Proceedings of the IEEE conference ICASSP-96, Atlanta, May 1996, pp 244-247.

Visual speech recognition using active shape models and hidden Markov models. J. Luettin, N. A. Thacker and S. W. Beet. Proceedings of the IEEE conference ICASSP-96, Atlanta, May 1996, pp 817-820.

Towards a better auditory representation for speech recognition. S. W. Beet and L. Baghai-Ravary. Proceedings of the ESCA Tutorial and Research Workshop “The Auditory Basis of Speech Perception”, Keele, July 1996, pp 287-290.

Modelling the flux inherent in speech representations. L. Baghai-Ravary, S. W. Beet and M. O. Tokhi. Applied Signal Processing, Vol. 3, pp 169-183. 1995.

Learning to recognise talking faces. J. Luettin, N. A. Thacker and S. W. Beet. Proceedings of the 13th IAPR International Conference on Pattern Recognition, Vienna, August 1996, pp 55-59.

Locating and tracking facial speech features. J. Luettin, N. A. Thacker and S. W. Beet. Proceedings of the 13th IAPR International Conference on Pattern Recognition, Vienna, August 1996, pp 652-656.

Statistical lip modelling for visual speech recognition. J. Luettin, N. A. Thacker and S. W. Beet. In “Signal Processing VIII: Theories and applications”, eds.: G. Ramponi, G. L. Sicuranza, S. Carrato and S. Marsi, pp 137-140. EURASIP, 1996.

Recognition of phonemes from estimation errors. L. Baghai-Ravary and S. W. Beet. In “Signal Processing VIII: Theories and applications”, eds.: G. Ramponi, G. L. Sicuranza, S. Carrato and S. Marsi, pp 1627-1630. EURASIP, 1996.

Speechreading using shape and intensity information. J. Luettin, N. A. Thacker and S. W. Beet. Proceedings of the Applied Science and Engineering Laboratories conference ICSLP ’96, Philadelphia, October 1996, pp 58-61.

Speaker identification by lipreading. J. Luettin, N. A. Thacker and S. W. Beet. Proceedings of the Applied Science and Engineering Laboratories conference ICSLP ’96, Philadelphia, October 1996, pp 62-65.

Estimating child and adolescent formant frequency values from adult data. P. Martland, S. P. Whiteside, S. W. Beet and L. Baghai-Ravary. Proceedings of the Applied Science and Engineering Laboratories conference ICSLP ’96, Philadelphia, October 1996, pp 622-625.

Analysis of ten vowel sounds across gender and regional/cultural accent. P. Martland, S. P. Whiteside, S. W. Beet and L. Baghai-Ravary. Proceedings of the Applied Science and Engineering Laboratories conference ICSLP ’96, Philadelphia, October 1996, pp 2231-2234.

Continuous adaptation of linear models with impulsive excitation. S. W. Beet and L. Baghai-Ravary. Proceedings of the Applied Science and Engineering Laboratories conference ICSLP ’96, Philadelphia, October 1996, pp 2250-2253.

Image coding by multi-step adaptive flux interpolation. L. Baghai-Ravary, S. W. Beet and M. O. Tokhi. IEE Proceedings on Vision, Image and Signal Processing, Vol. 143, pp 337-343. December 1996.

Automatic segmentation: data-driven units of speech. S. W. Beet and L. Baghai-Ravary. Proceedings of the ESCA conference Eurospeech ’97, Rhodos, September 1997, pp 505-508.

The Future of Telephone Speech Recognition. L. Baghai-Ravary and S. W. Beet. Proceedings of the 2nd Advanstar Speech Technology Congress, Voice Germany ’98, Köln, June 1998.

Multi-step coding of speech parameters for compression. L. Baghai-Ravary and S. W. Beet. IEEE Transactions on Speech and Audio Processing, Vol. 6, pp 435-444. September 1998.

Just how big is the difference between text-to-speech and text-to-telephone speech systems? S. W. Beet and L. Baghai-Ravary. Proceedings of the 3rd Advanstar Speech Recognition Symposium, Voice Europe ’98, London, October 1998.

What Use is Telephone Speech Synthesis? S. W. Beet. Voice+, Vol. 6, No. 7, pp 78-83. October 1999.

What Use is Telephone Speech Synthesis? S. W. Beet. Proceedings of the 4th Advanstar Speech Recognition Symposium, Voice Europe ’99, London, November 1999.

Automatic telephone voice analysis and therapy. L. Baghai-Ravary and S. W. Beet. UK Speech Conference Abstracts, p.23, Cambridge, September 2017.

VoiScan: Telephone Voice Analysis for Health and Biometric Applications. L. Baghai-Ravary and S. W. Beet. In: Karpov A., Potapova R., Mporas I. (eds), Speech and Computer. SPECOM 2017. Lecture Notes in Computer Science, vol. 10458. Springer. 2017.




Site MapAll contents of this site copyright © 1996 – 2018 Steve Beet
Last modified: 23:10, Saturday, 13 January, 2018
Primary host: x10
Mirror: Tripod