Mercurial > hg > audiostuff
diff spandsp-0.0.6pre17/src/spandsp/v17rx.h @ 4:26cd8f1ef0b1
import spandsp-0.0.6pre17
author | Peter Meerwald <pmeerw@cosy.sbg.ac.at> |
---|---|
date | Fri, 25 Jun 2010 15:50:58 +0200 |
parents | |
children |
line wrap: on
line diff
--- /dev/null Thu Jan 01 00:00:00 1970 +0000 +++ b/spandsp-0.0.6pre17/src/spandsp/v17rx.h Fri Jun 25 15:50:58 2010 +0200 @@ -0,0 +1,334 @@ +/* + * SpanDSP - a series of DSP components for telephony + * + * v17rx.h - ITU V.17 modem receive part + * + * Written by Steve Underwood <steveu@coppice.org> + * + * Copyright (C) 2003 Steve Underwood + * + * All rights reserved. + * + * This program is free software; you can redistribute it and/or modify + * it under the terms of the GNU Lesser General Public License version 2.1, + * as published by the Free Software Foundation. + * + * This program is distributed in the hope that it will be useful, + * but WITHOUT ANY WARRANTY; without even the implied warranty of + * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the + * GNU Lesser General Public License for more details. + * + * You should have received a copy of the GNU Lesser General Public + * License along with this program; if not, write to the Free Software + * Foundation, Inc., 675 Mass Ave, Cambridge, MA 02139, USA. + * + * $Id: v17rx.h,v 1.65 2009/07/09 13:52:09 steveu Exp $ + */ + +/*! \file */ + +#if !defined(_SPANDSP_V17RX_H_) +#define _SPANDSP_V17RX_H_ + +/*! \page v17rx_page The V.17 receiver +\section v17rx_page_sec_1 What does it do? +The V.17 receiver implements the receive side of a V.17 modem. This can operate +at data rates of 14400, 12000, 9600 and 7200 bits/second. The audio input is a stream +of 16 bit samples, at 8000 samples/second. The transmit and receive side of V.17 +modems operate independantly. V.17 is mostly used for FAX transmission over PSTN +lines, where it provides the standard 14400 bits/second rate. + +\section v17rx_page_sec_2 How does it work? +V.17 uses QAM modulation, at 2400 baud, and trellis coding. Constellations with +16, 32, 64, and 128 points are defined. After one bit per baud is absorbed by the +trellis coding, this gives usable bit rates of 7200, 9600, 12000, and 14400 per +second. + +V.17 specifies a training sequence at the start of transmission, which makes the +design of a V.17 receiver relatively straightforward. The first stage of the +training sequence consists of 256 +symbols, alternating between two constellation positions. The receiver monitors +the signal power, to sense the possible presence of a valid carrier. When the +alternating signal begins, the power rising above a minimum threshold (-43dBm0) +causes the main receiver computation to begin. The initial measured power is +used to quickly set the gain of the receiver. After this initial settling, the +front end gain is locked, and the adaptive equalizer tracks any subsequent +signal level variation. The signal is oversampled to 24000 samples/second (i.e. +signal, zero, zero, signal, zero, zero, ...) and fed to a complex root raised +cosine pulse shaping filter. This filter has been modified from the conventional +root raised cosine filter, by shifting it up the band, to be centred at the nominal +carrier frequency. This filter interpolates the samples, pulse shapes, and performs +a fractional sample delay at the same time. 192 sets of filter coefficients are used +to achieve a set of finely spaces fractional sample delays, between zero and +one sample. By choosing every fifth sample, and the appropriate set of filter +coefficients, the properly tuned symbol tracker can select data samples at 4800 +samples/second from points within 0.28 degrees of the centre and mid-points of +each symbol. The output of the filter is multiplied by a complex carrier, generated +by a DDS. The result is a baseband signal, requiring no further filtering, apart from +an adaptive equalizer. The baseband signal is fed to a T/2 adaptive equalizer. +A band edge component maximisation algorithm is used to tune the sampling, so the samples +fed to the equalizer are close to the mid point and edges of each symbol. Initially +the algorithm is very lightly damped, to ensure the symbol alignment pulls in +quickly. Because the sampling rate will not be precisely the same as the +transmitter's (the spec. says the symbol timing should be within 0.01%), the +receiver constantly evaluates and corrects this sampling throughout its +operation. During the symbol timing maintainence phase, the algorithm uses +a heavier damping. + +The carrier is specified as 1800Hz +- 1Hz at the transmitter, and 1800 +-7Hz at +the receiver. The receive carrier would only be this inaccurate if the link +includes FDM sections. These are being phased out, but the design must still +allow for the worst case. Using an initial 1800Hz signal for demodulation gives +a worst case rotation rate for the constellation of about one degree per symbol. +Once the symbol timing synchronisation algorithm has been given time to lock to the +symbol timing of the initial alternating pattern, the phase of the demodulated signal +is recorded on two successive symbols - once for each of the constellation positions. +The receiver then tracks the symbol alternations, until a large phase jump occurs. +This signifies the start of the next phase of the training sequence. At this +point the total phase shift between the original recorded symbol phase, and the +symbol phase just before the phase jump occurred is used to provide a coarse +estimation of the rotation rate of the constellation, and it current absolute +angle of rotation. These are used to update the current carrier phase and phase +update rate in the carrier DDS. The working data already in the pulse shaping +filter and equalizer buffers is given a similar step rotation to pull it all +into line. From this point on, a heavily damped integrate and dump approach, +based on the angular difference between each received constellation position and +its expected position, is sufficient to track the carrier, and maintain phase +alignment. A fast rough approximator for the arc-tangent function is adequate +for the estimation of the angular error. + +The next phase of the training sequence is a scrambled sequence of two +particular symbols. We train the T/2 adaptive equalizer using this sequence. The +scrambling makes the signal sufficiently diverse to ensure the equalizer +converges to the proper generalised solution. At the end of this sequence, the +equalizer should be sufficiently well adapted that is can correctly resolve the +full QAM constellation. However, the equalizer continues to adapt throughout +operation of the modem, fine tuning on the more complex data patterns of the +full QAM constellation. + +In the last phase of the training sequence, the modem enters normal data +operation, with a short defined period of all ones as data. As in most high +speed modems, data in a V.17 modem passes through a scrambler, to whiten the +spectrum of the signal. The transmitter should initialise its data scrambler, +and pass the ones through it. At the end of the ones, real data begins to pass +through the scrambler, and the transmit modem is in normal operation. The +receiver tests that ones are really received, in order to verify the modem +trained correctly. If all is well, the data following the ones is fed to the +application, and the receive modem is up and running. Unfortunately, some +transmit side of some real V.17 modems fail to initialise their scrambler before +sending the ones. This means the first 23 received bits (the length of the +scrambler register) cannot be trusted for the test. The receive modem, +therefore, only tests that bits starting at bit 24 are really ones. + +The V.17 signal is trellis coded. Two bits of each symbol are convolutionally coded +to form a 3 bit trellis code - the two original bits, plus an extra redundant bit. It +is possible to ignore the trellis coding, and just decode the non-redundant bits. +However, the noise performance of the receiver would suffer. Using a proper +trellis decoder adds several dB to the noise tolerance to the receiving modem. Trellis +coding seems quite complex at first sight, but is fairly straightforward once you +get to grips with it. + +Trellis decoding tracks the data in terms of the possible states of the convolutional +coder at the transmitter. There are 8 possible states of the V.17 coder. The first +step in trellis decoding is to find the best candidate constellation point +for each of these 8 states. One of thse will be our final answer. The constellation +has been designed so groups of 8 are spread fairly evenly across it. Locating them +is achieved is a reasonably fast manner, by looking up the answers in a set of space +map tables. The disadvantage is the tables are potentially large enough to affect +cache performance. The trellis decoder works over 16 successive symbols. The result +of decoding is not known until 16 symbols after the data enters the decoder. The +minimum total accumulated mismatch between each received point and the actual +constellation (termed the distance) is assessed for each of the 8 states. A little +analysis of the coder shows that each of the 8 current states could be arrived at +from 4 different previous states, through 4 different constellation bit patterns. +For each new state, the running total distance is arrived at by inspecting a previous +total plus a new distance for the appropriate 4 previous states. The minimum of the 4 +values becomes the new distance for the state. Clearly, a mechanism is needed to stop +this distance from growing indefinitely. A sliding window, and several other schemes +are possible. However, a simple single pole IIR is very simple, and provides adequate +results. + +For each new state we store the constellation bit pattern, or path, to that state, and +the number of the previous state. We find the minimum distance amongst the 8 new +states for each new symbol. We then trace back through the states, until we reach the +one 16 states ago which leads to the current minimum distance. The bit pattern stored +there is the error corrected bit pattern for that symbol. + +So, what does Trellis coding actually achieve? TCM is easier to understand by looking +at the V.23bis modem spec. The V.32bis spec. is very similar to V.17, except that it +is a full duplex modem and has non-TCM options, as well as the TCM ones in V.17. + +V32bis defines two options for pumping 9600 bits per second down a phone line - one +with and one without TCM. Both run at 2400 baud. The non-TCM one uses simple 16 point +QAM on the raw data. The other takes two out of every four raw bits, and convolutionally +encodes them to 3. Now we have 5 bits per symbol, and we need 32 point QAM to send the +data. + +The raw error rate from simple decoding of the 32 point QAM is horrible compared to +decoding the 16 point QAM. If a point decoded from the 32 point QAM is wrong, the likely +correct choice should be one of the adjacent ones. It is unlikely to have been one that +is far away across the constellation, unless there was a huge noise spike, interference, +or something equally nasty. Now, the 32 point symbols do not exist in isolation. There +was a kind of temporal smearing in the convolutional coding. It created a well defined +dependency between successive symbols. If we knew for sure what the last few symbols +were, they would lead us to a limited group of possible values for the current symbol, +constrained by the behaviour of the convolutional coder. If you look at how the symbols +were mapped to constellation points, you will see the mapping tries to spread those +possible symbols as far apart as possible. This will leave only one that is pretty +close to the received point, which must be the correct choice. However, this assumes +we know the last few symbols for sure. Since we don't, we have a bit more work to do +to achieve reliable decoding. + +Instead of decoding to the nearest point on the constellation, we decode to a group of +likely constellation points in the neighbourhood of the received point. We record the +mismatch for each - that is the distance across the constellation between the received +point and the group of nearby points. To avoid square roots, recording x2 + y2 can be +good enough. Symbol by symbol, we record this information. After a few symbols we can +stand back and look at the recorded information. + +For each symbol we have a set of possible symbol values and error metric pairs. The +dependency between symbols, created by the convolutional coder, means some paths from +symbol to symbol are possible and some are not. It we trace back through the possible +symbol to symbol paths, and total up the error metric through those paths, we end up +with a set of figures of merit (or more accurately figures of demerit, since +larger == worse) for the likelihood of each path being the correct one. The path with +the lowest total metric is the most likely, and gives us our final choice for what we +think the current symbol really is. + +That was hard work. It takes considerable computation to do this selection and traceback, +symbol by symbol. We need to get quite a lot from this. It needs to drive the error rate +down so far that is compensates for the much higher error rate due to the larger +constellation, and then buys us some actual benefit. Well in the example we are looking +at - V.32bis at 9600bps - it works out the error rate from the TCM option is like using +the non-TCM option with several dB more signal to noise ratio. That's nice. The non-TCM +option is pretty reasonable on most phone lines, but a better error rate is always a +good thing. However, V32bis includes a 14,400bps option. That uses 2400 baud, and 6 bit +symbols. Convolutional encoding increases that to 7 bits per symbol, by taking 2 bits and +encoding them to 3. This give a 128 point QAM constellation. Again, the difference between +using this, and using just an uncoded 64 point constellation is equivalent to maybe 5dB of +extra signal to noise ratio. However, in this case it is the difference between the modem +working only on the most optimal lines, and being widely usable across most phone lines. +TCM absolutely transformed the phone line modem business. +*/ + +/*! + V.17 modem receive side descriptor. This defines the working state for a + single instance of a V.17 modem receiver. +*/ +typedef struct v17_rx_state_s v17_rx_state_t; + +#if defined(__cplusplus) +extern "C" +{ +#endif + +/*! Initialise a V.17 modem receive context. + \brief Initialise a V.17 modem receive context. + \param s The modem context. + \param bit_rate The bit rate of the modem. Valid values are 7200, 9600, 12000 and 14400. + \param put_bit The callback routine used to put the received data. + \param user_data An opaque pointer passed to the put_bit routine. + \return A pointer to the modem context, or NULL if there was a problem. */ +SPAN_DECLARE(v17_rx_state_t *) v17_rx_init(v17_rx_state_t *s, int bit_rate, put_bit_func_t put_bit, void *user_data); + +/*! Reinitialise an existing V.17 modem receive context. + \brief Reinitialise an existing V.17 modem receive context. + \param s The modem context. + \param bit_rate The bit rate of the modem. Valid values are 7200, 9600, 12000 and 14400. + \param short_train TRUE if a short training sequence is expected. + \return 0 for OK, -1 for bad parameter */ +SPAN_DECLARE(int) v17_rx_restart(v17_rx_state_t *s, int bit_rate, int short_train); + +/*! Release a V.17 modem receive context. + \brief Release a V.17 modem receive context. + \param s The modem context. + \return 0 for OK */ +SPAN_DECLARE(int) v17_rx_release(v17_rx_state_t *s); + +/*! Free a V.17 modem receive context. + \brief Free a V.17 modem receive context. + \param s The modem context. + \return 0 for OK */ +SPAN_DECLARE(int) v17_rx_free(v17_rx_state_t *s); + +/*! Get the logging context associated with a V.17 modem receive context. + \brief Get the logging context associated with a V.17 modem receive context. + \param s The modem context. + \return A pointer to the logging context */ +SPAN_DECLARE(logging_state_t *) v17_rx_get_logging_state(v17_rx_state_t *s); + +/*! Change the put_bit function associated with a V.17 modem receive context. + \brief Change the put_bit function associated with a V.17 modem receive context. + \param s The modem context. + \param put_bit The callback routine used to handle received bits. + \param user_data An opaque pointer. */ +SPAN_DECLARE(void) v17_rx_set_put_bit(v17_rx_state_t *s, put_bit_func_t put_bit, void *user_data); + +/*! Change the modem status report function associated with a V.17 modem receive context. + \brief Change the modem status report function associated with a V.17 modem receive context. + \param s The modem context. + \param handler The callback routine used to report modem status changes. + \param user_data An opaque pointer. */ +SPAN_DECLARE(void) v17_rx_set_modem_status_handler(v17_rx_state_t *s, modem_rx_status_func_t handler, void *user_data); + +/*! Process a block of received V.17 modem audio samples. + \brief Process a block of received V.17 modem audio samples. + \param s The modem context. + \param amp The audio sample buffer. + \param len The number of samples in the buffer. + \return The number of samples unprocessed. +*/ +SPAN_DECLARE_NONSTD(int) v17_rx(v17_rx_state_t *s, const int16_t amp[], int len); + +/*! Fake processing of a missing block of received V.17 modem audio samples. + (e.g due to packet loss). + \brief Fake processing of a missing block of received V.17 modem audio samples. + \param s The modem context. + \param len The number of samples to fake. + \return The number of samples unprocessed. +*/ +SPAN_DECLARE(int) v17_rx_fillin(v17_rx_state_t *s, int len); + +/*! Get a snapshot of the current equalizer coefficients. + \brief Get a snapshot of the current equalizer coefficients. + \param s The modem context. + \param coeffs The vector of complex coefficients. + \return The number of coefficients in the vector. */ +#if defined(SPANDSP_USE_FIXED_POINTx) +SPAN_DECLARE(int) v17_rx_equalizer_state(v17_rx_state_t *s, complexi_t **coeffs); +#else +SPAN_DECLARE(int) v17_rx_equalizer_state(v17_rx_state_t *s, complexf_t **coeffs); +#endif + +/*! Get the current received carrier frequency. + \param s The modem context. + \return The frequency, in Hertz. */ +SPAN_DECLARE(float) v17_rx_carrier_frequency(v17_rx_state_t *s); + +/*! Get the current symbol timing correction since startup. + \param s The modem context. + \return The correction. */ +SPAN_DECLARE(float) v17_rx_symbol_timing_correction(v17_rx_state_t *s); + +/*! Get a current received signal power. + \param s The modem context. + \return The signal power, in dBm0. */ +SPAN_DECLARE(float) v17_rx_signal_power(v17_rx_state_t *s); + +/*! Set the power level at which the carrier detection will cut in + \param s The modem context. + \param cutoff The signal cutoff power, in dBm0. */ +SPAN_DECLARE(void) v17_rx_signal_cutoff(v17_rx_state_t *s, float cutoff); + +/*! Set a handler routine to process QAM status reports + \param s The modem context. + \param handler The handler routine. + \param user_data An opaque pointer passed to the handler routine. */ +SPAN_DECLARE(void) v17_rx_set_qam_report_handler(v17_rx_state_t *s, qam_report_handler_t handler, void *user_data); + +#if defined(__cplusplus) +} +#endif + +#endif +/*- End of file ------------------------------------------------------------*/