Github is just the place where the texts are stored, it’s not an application. Or it is, but not that kind of application.
CST4 or DPR—or for that matter SC—offer an integrated package that facilitates certain kinds of search and analysis. These cover a lot of ordinary use cases and are fine for most people.
However, if you need some kind of specialized analysis not covered by these apps, one approach you could use is to clone the bilara-data repo locally. Then you can search or analyze it with any tool you like. Since it is pure and battle-tested JSON, it is easy to transform into any format, or just treat it as plain text.
For myself, my main tools are Sublime Text, which lets me do rich searching and regular expressions across the whole corpus, or a defined subset; and Libreoffice Calc, where I can import texts via bilara i/o and query or manipulate them in the various ways that a spreadsheet makes possible.
For example, i might want to search for cases where “dhamma” is translated as “thing”. Searches for “dhamma” or “thing” would be painful, but using bilara i/o I can do both at once.
(Incidentally, this is possible also in the Bilara webapp for translators, and we hope to bring to SC one day!)
More ambitiously, someone with some basic programming skills can use one of the tools mentioned below by Erik, to which I would add texthero;
More advanced still, neural nets offer new possibilities, as can be seen at Buddhanexus:
(Not to be a party-pooper, but neural nets in their current form have, in my view, over-promised and under-delivered, and we seem to be approaching the diminishing returns phase of their evolution. Still, they may yet make significant contributions to Buddhist studies.)
That’s cool. You can also do this using sublime text and regular expressions, but it’s more of a learning curve.