DeepWriter: A Multi-Stream Deep CNN for Text-independent Writer Identification

06/21/2016
by   Linjie Xing, et al.
0

Text-independent writer identification is challenging due to the huge variation of written contents and the ambiguous written styles of different writers. This paper proposes DeepWriter, a deep multi-stream CNN to learn deep powerful representation for recognizing writers. DeepWriter takes local handwritten patches as input and is trained with softmax classification loss. The main contributions are: 1) we design and optimize multi-stream structure for writer identification task; 2) we introduce data augmentation learning to enhance the performance of DeepWriter; 3) we introduce a patch scanning strategy to handle text image with different lengths. In addition, we find that different languages such as English and Chinese may share common features for writer identification, and joint training can yield better performance. Experimental results on IAM and HWDB datasets show that our models achieve high identification accuracy: 99.01 one English sentence input, 93.85 input, which outperform previous methods with a large margin. Moreover, our models obtain accuracy of 98.01 as input.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset