tokenizer/cindrella_stories.txt tokenizer/README.md __pycache__