Upload a plain txt file (UTF-8)

Usage notes

Upload a Chinese plain text in order to estimate the century of it's creation, based on the lexeme types found in the text. Maximum upload file size is around 6 MB, but please expect processing times of several minutes for large amounts of text. Note that processing is based on character n-grams for reasons of performance and unavailability of reliable segmentation for texts of unknown period of creation.

A minimum text length of 1000+ words, or better a few thousand characters is recommended.

By uploading a text, you confirm that you permit me to temporarily store it on the server for the purpose of textual analysis, and that you own the right to do so.

Settings
  • Considered lexeme type length (字) 2–4
  • Keep punctuation True
  • Minimum token count for consideration 1
  • Use corpus training data from Loewe / 正史 True
  • Use additional corpus training data from 地方誌 True
  • Normalize variant characters to HYDCD standard True