Contents & References of Speech signal enhancement in the time-frequency domain
List:
Acknowledgment and thanks.
Abstract. H
List of figures. Q
List of tables. P.
Chapter One: 1
Introduction. 1
1-1 Preface. 1
1-2 Improvement of noisy speech: goals, applications, concepts. 2
1-3 defining the problem and dividing the methods. 3
1-4 research innovation. 4
1-5 Thesis structure. 4
The second chapter. 5
backgrounds of speech signal processing. 5
2-1 How to produce speech in humans. 5
2-2 Introducing noise and its types. 10
2-2-1 white noise. 13
2-2-2 Pink noise. 13
2-2-3 brown noise. 14
2-2-4 Industrial noise. 14
2-3 Time-frequency analysis of the speech signal. 15
2-3-1 Fourier transform. 15
2-3-2 Short-time Fourier transform. 17
2-3-3 multi-precision time-frequency analysis. 20
2-3-4 one-dimensional wavelet transform. 20
2-3-4-1 continuous wavelet transform. 20
2-3-4-1-1 time and frequency accuracies. 22
2-3-4-1-2 Mathematical relationships of wavelet transform: 22
2-3-4-1-3 Picture of wavelet transform: 24
2-3-4-2 discrete wavelet transform. 24
2-4 genetic optimization algorithm. 28
2-4-1 about the science of genetics. 28
2-4-2 The history of genetic science. 29 2-4-3 Natural evolution (Darwin's law of natural selection) and its relationship with artificial intelligence methods 29 2-4-4 Genetic algorithm. 32
2-4-5 genetic algorithm mechanism. 34
2-4-6 genetic algorithm operations. 37
2-4-6-1 coding. 37
2-4-6-2 Evaluation. 37
2-4-6-3 combination. 37
2-4-6-4 jump. 37
2-4-6-5 decoding. 38
2-4-7 Algorithm chart with its pseudo code. 38
2-4-7-1 pseudo code and its explanation. 38
2-4-7-2 genetic algorithm chart. 40
2-4-8 objective function. 41
2-4-9 coding methods. 41
2-4-9-1 Binary coding. 42
2-4-9-2 substitution coding. 42
2-4-9-3 value coding. 43
2-4-9-4 coding tree. 44
2-4-10 Showing strings. 45
2-4-11 population. 46
2-4-11-1 Creating the initial population. 46
2-4-11-2 population size. 46
2-4-12 calculation of fitness (value function) 47
2-4-13 types of selection methods. 48
2-4-13-1 Roulette wheel selection. 49
2-4-13-2 Selection of steady state. 51
2-4-13-3 The choice of elitism. 51
2-4-13-4 competitive selection. 51
2-4-13-5 Choosing to cut off the head. 52
2-4-13-6 Brindle's definitive choice. 52
2-4-13-7 Choice of modified generation replacement. 53
2-4-13-8 Match selection. 53
2-4-13-9 Random match selection. 53
2-4-14 Types of composition methods. 53
2-4-14-1 binary transfer. 54
2-4-14-2 Real displacement. 56
2-4-14-3 single point combination. 57
2-4-14-4 two-point combination. 58
2-4-14-5 n point combination. 58
2-4-14-6 uniform composition. 58
2-4-14-7 Arithmetic composition. 59
2-4-14-8 order. 59
2-4-14-9 cycles. 60
2-4-15 possibility of combination. 60
2-4-16 Analysis of displacement mechanism. 61
2-4-17 mutation. 61
2-4-17-1 Binary mutation. 63
2-4-17-2 True mutation. 64
2-4-17-3 Bit inversion. 64
2-4-17-4 Changing the placement order. 64
2-4-17-5 Inversion. 64
2-4-17-6 Value change. 65
2-4-18 The benchmark for the conclusion of the genetic algorithm implementation. 65
2-4-19 strengths of genetic algorithms. 66
2-4-20 Limitations of GAs. 68
2-5 Analysis of linear prediction coefficients (LPC) 69
2-5-1 Calculation of LPC coefficients. 70
The third chapter. 73
A review of major methods of speech improvement. 73
3-1 Introduction. 73
3-2 Spectral subtraction method. 74
3-3 Wiener filter method. 76
3-4 speech improvement using statistical models. 78
3-4-1 Logarithmic estimator based on minimizing mean square error (Log MMSE) 78
3-4-2 Using hidden Markov model (HMM) for speech enhancement. 80
3-5 methods under the signal space. 82
3-6 Speech enhancement using wavelet transform. 83
3-7 Comparison of methods and examination of strengths and weaknesses. 85
3-7-1 Review85
3-7-1 Comparative studies conducted between some speech optimization methods 86
2-3-2 A summary of the characteristics and strengths and weaknesses of different methods. 87
3-8 important points and considerations in the design of the speech improvement system. 89
3-8-1 Use of combined systems. 89
3-8-2 Use of sub-band processing and its benefits. 89
3-8-3 Using the second microphone. 90
Chapter Four: Suggested methods. 92
4-1 Introduction. 92
2-4 Suggested methods. 93
4-2-1 Improving audio signals using genetic algorithm and LPC analysis in wavelet subtraction method. 93
4-2-1-1 Wavelet coefficients spectral subtraction method (WSS) 94
4-2-1-2 Modification of the wavelet coefficients spectral subtraction method (IWSS) 95
4-2-1-3 Noise estimation. 96
4-2-1-4 genetic algorithm. 97
4-2-1-4-1 Selection operator. 97
4-2-1-4-2 cutting operator. 98
4-2-1-4-3 mutation operator. 98
4-2-1-4-4 initial population. 98
4-2-1-4-5 objective function. 98
4-2-2 Improvement of audio signals using mean square error method in wavelet space 98
4-2-2-1 Log MMSE estimator in wavelet space. 99
4-2-2-2 noise estimation. 100
Chapter Five: Results and Experiments. 101
5-1 Introduction. 101
5-2 Implementation details. 102
3-5 results of audio signal improvement using genetic algorithm and LPC analysis in wavelet subtraction method. 103
4-5 results of sound signal improvement using the mean square error method in wavelet space 106
Chapter 6: conclusions and suggestions. 109
6-1 Conclusion. 109
6-2 Suggestions for future work. 111
References 112
Source:
[1] P.C. Loizou, Speech Enhancement: Theory and Practice, CRC Press, Boca Raton, FL, 2007. section 3, p 46
[2] Borden, G., Harris, K., and Raphael, L., Speech science Primier, 3rd ed., Baltimore, MID: Williams and Wilkins.
[3] Rosenberg, A., Effect of glottal pulse shape on the quality of natural vowels, J. Acoust. Soc. Am., 49(2), 583-588 (1971).
[4] S. G. Mallat, "A Theory for Multiresolution Signal Decomposition: The Wavelet Representation," IEEE Transactions on Pattern Analysis Machine Intelligence, July 1989.
[5] A. Sayadi, "Preliminary Introduction to Wavelet Transform," School of Electrical Engineering, Sharif University of Technology, Isfand 87.
[6] C. S. Burrus, R. A. Gopinath and H. Guo, Introduction to Wavelets and the Wavelet Transforms: A Primer. Upper Saddle River, Prentice Hall, Inc., 1998.
[7] www.fa.wikipedia.org/wiki/ zhentik.htm
[8] www.en.wikipedia.org/wiki/Gergor_Mandel.htm
[9] 1387 Abid, Tehran, "Genetic Algorithms and Optimization of Composite Structures", Bavari A, Salehi M.
[10] www.fa.wikipedia.org/wiki/Algorithm_Jhentik.htm
[11] JamJam Newspaper - Click Addendum - Number 209, Tehran, "Research on Genetic Algorithm", Shahmiri A 1387
[12] "Applicable training of Genetic Algorithm in MATLAB software" Rezaei A, Ranjbaran 1386, Tehran, Azar Publications
[13] D.A Coley.,"An Introduction to genetic Algorithms for scientists and engineers", word scientific, 2000
[14] Goldberg D.E,"Genetic Algorithms in Search Optimization and Machine Learning", Addisson Wesley Longman Inc., 1997.
[15] S. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Process, 1979.
[16] M. Berouti, R. Schwartz, and J. Makhoul, enhancement of speech corrupted by acoustic noise, proc. IEEE ICASSP, Washington DC, April 1979, 208-211.
[17] Zenton Goh, Kah-Chye Tan, B T G Tan, "Postprocessing Method for Suppressing Musical Noise Generated by Spectral Subtraction" IEEE trans. on Speech and Audio Processing 1998
[18] Bodin, P. and Villemoes, L. F. (1997). Spectral subtraction in the time-frequency domain using wavelet packets. IEEE Workshop on Speech Coding for Telecommunications. Proceedings, New York, IEEE, 47{48.