gokulnkreading listnotes

term frequency inverse document frequency

information density frequencies in the document versus the whole corpus. if something comes across all documents, it may not be so unique.

relative frequency

TODO

  1. todo
  2. should look at the number of times the word undefined appears in the internet texts. prompting an LLM might be a nice way of doing this.
  3. bm25

    All notes
    gokulnkreadinglistnotes
    © 2026, Site By @gokulnk