1DAWG2WORDLIST(1) 2================ 3:doctype: manpage 4 5NAME 6---- 7dawg2wordlist - convert a Tesseract DAWG to a wordlist 8 9SYNOPSIS 10-------- 11*dawg2wordlist* 'UNICHARSET' 'DAWG' 'WORDLIST' 12 13DESCRIPTION 14----------- 15dawg2wordlist(1) converts a Tesseract Directed Acyclic Word 16Graph (DAWG) to a list of words using a unicharset as key. 17 18OPTIONS 19------- 20'UNICHARSET' 21 The unicharset of the language. This is the unicharset 22 generated by mftraining(1). 23 24'DAWG' 25 The input DAWG, created by wordlist2dawg(1) 26 27'WORDLIST' 28 Plain text (output) file in UTF-8, one word per line 29 30SEE ALSO 31-------- 32tesseract(1), mftraining(1), wordlist2dawg(1), unicharset(5), 33combine_tessdata(1) 34 35<https://tesseract-ocr.github.io/tessdoc/Training-Tesseract.html> 36 37COPYING 38------- 39Copyright \(C) 2012 Google, Inc. 40Licensed under the Apache License, Version 2.0 41 42AUTHOR 43------ 44The Tesseract OCR engine was written by Ray Smith and his research groups 45at Hewlett Packard (1985-1995) and Google (2006-present). 46