Text this: A smart vision system for early driver drowsiness detection using CNN–transformer hybrid model with attention-based learning