OK, got it working here ...

You do need to disable T4 ints (last post).

And I haven't figured out why yet, but it was missing a bit in each byte, so I increased _bit_cntr to 9 and now it works.

I'll play with it some more and post the final results.
<br>