2 repositories on SrcLog
Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format
Simple app for visual editing of Page XML files