Prevent requests for previously provided iso tx packets

This bugfix grew out of an extended investigation into a problem encountered
by a small number of people running FFADO.  FFADO would report that the tx
iso cycle number supplied to the iso tx callback seemingly went backwards -
something which should not ordinarily occur.  The bug seemed to be sensitive
to timing and in some cases would disappear when debug traces were inserted
into either FFADO or libraw1394.  In essence, libraw1394 was requesting tx
data for cycles which had already been requested.

Initial discussions can be found in the thread "Problem with RME FF800. Can
not start jackd" on the ffado-user mailing list.  A followup investigation
is tracked in FFADO ticket number 379
(http://subversion.ffado.org/ticket/379) and referenced in the thread
"Revisiting backward cycle number sequence (ticket 379)" on ffado-devel.
The latter mailing list thread includes a lengthy explanation of what I
think is happening.

To summarise, the root of the problem seems to be that on certain machines
under certain conditions, something causes the kernel to post an iso tx
event at a time when fewer than irq_interval packets have been transmitted.
Unfortunately it has not been possible to determine the underlying cause of
this.  Whatever the cause, tests carried out with the reporter of ticket 379
have shown that it is occurring.  As a result, the adjustment to
libraw1394's packet_count must be done with reference to the number of
packets reported as transmitted by the kernel instead of simply assuming
that irq_interval packets have been sent.

A patch implementing this fix is at the end of this post.  This fixes the
problem when the newer ABI is in use, which provides tx packet timestamps
(and thus an indication of the number of packets actually transmitted) to
userspace.  It does not address the problem when the older ABI is used, but
given the nature of the problem I don't think it's possible to fix it
without access to the timestamps (or at least without some way to determine
the number of packets really transmitted).

Testing by "juanramon" (see ticket 379) has demonstrated that it fixes the
"backward cycle number" problem on his machine.

Thanks to Andreas Hehn and "juanramon" for their invaluable help in tracking
this down.

Signed-off-by: Jonathan Woithe <jwoithe@just42.net>
Signed-off-by: Stefan Richter <stefanr@s5r6.in-berlin.de>
This commit is contained in:
Jonathan Woithe 2014-06-10 23:10:57 +09:30 committed by Stefan Richter
parent 242ed444d3
commit c756fb8321
1 changed files with 2 additions and 2 deletions

View File

@ -322,10 +322,9 @@ static int handle_iso_event(raw1394handle_t handle,
{ {
int cycle; int cycle;
fwhandle->iso.packet_count -= fwhandle->iso.irq_interval;
/* Check whether the ABI version provides iso tx timestamps. */ /* Check whether the ABI version provides iso tx timestamps. */
if (interrupt->header_length) { if (interrupt->header_length) {
fwhandle->iso.packet_count -= interrupt->header_length/4;
/* /*
* Take the cycle of the last packet transmitted, add * Take the cycle of the last packet transmitted, add
* the number of packets currently queued, plus one, and * the number of packets currently queued, plus one, and
@ -335,6 +334,7 @@ static int handle_iso_event(raw1394handle_t handle,
cycle = be32_to_cpu(interrupt->header[interrupt->header_length/4 - 1]); cycle = be32_to_cpu(interrupt->header[interrupt->header_length/4 - 1]);
cycle &= 0x1fff; cycle &= 0x1fff;
} else { } else {
fwhandle->iso.packet_count -= fwhandle->iso.irq_interval;
/* /*
* Bogusly faking it again. Assume that the last packet * Bogusly faking it again. Assume that the last packet
* transmitted was transmitted on interrupt->cycle. * transmitted was transmitted on interrupt->cycle.