Once we manage to recognize a one-handed gesture, expand to two hands. This involves more feature selection (relative x, y, z positions? Relative rotations?)

Neural network will require changes to accept a larger input (duplicate the same measurements for left hand, plus any relative position inputs you think would help), and preprocessing will probably become integral at this point.