In speech recognition, the primary signal used is an audio signal. This audio signal is captured by a microphone and then processed by algorithms to extract features that are used for recognizing and understanding the spoken words or phrases. These features often include aspects such as frequency, amplitude, and duration of sound waves, which are then analyzed and interpreted by machine learning models or other computational techniques to recognize the spoken words.