find a source that gives the all 700 versus across 700 adhyayas.
use chatgpt to change the transliteration of these hindi slokas to kannada and telugu.
tat pada artha. break the complex words into simple words.
- Sanskrit Corpus (CLTK): The Classical Language Toolkit (CLTK) is a Python library that provides a suite of text processing libraries for classical languages, including Sanskrit. It provides tools for a variety of tasks, including tokenization, part-of-speech tagging, and syntactic parsing.
Sanskrit-Data: A python library for Sanskrit linguistic data processing. It provides tools for processing Sanskrit words, including segmentation and normalization.
SanskritNet Morphological Analyzer: A tool to perform morphological analysis of Sanskrit words. This can be useful for breaking down complex words into their constituent parts.
Indic NLP Library: The Indic NLP library provides a suite of NLP tools for Indian languages, including Sanskrit. It includes tools for tokenization, transliteration, and other processing tasks.
Sanskrit Segmenter: A tool specifically designed for segmenting Sanskrit words into morphemes.
get the list of all the base words in gita and sort by their frequency
get the word by word meaning, sentence by sentence meaning and para by para meaning.
the format to go to any chapter or verse. 3:11 should refer to chapter 3 and verse 11 and so on.
use elastic search so that we can search through all of these.
build a simple app using react native to create an app
resources
- https://nbviewer.org/url/anoopkunchukuttan.github.io/indic_nlp_library/doc/indic_nlp_examples.ipynb
- in notebooks you can do pip install and you can even clone from github for dependencies
Referenced in:
All notes