How does the extract feature determine what text to return?
| |
Example:
If the valid alphabet was:
ehlo
and the file contents were:
" hello, how the hell are you hell hehe"
FileMonkey would first strip all characters that do not appear in the valid alphabet which would give the following list of words:
hello ho e hell e o hell hehe
If the minimum keyword length is 3, FileMonkey would ignore all keywords of smaller length and return the words:
hello hell hell hehe
If the maximum keyword length is 4, FileMonkey would ignore all keywords of larger length and return the words:
hell hell hehe
Then, if you are stipping repeats, FileMonkey would return:
hell hehe
And if your keyword pattern was ?e?e then FileMonkey would return
hehe
|
|