On segmenting SA


I just discovered that there is in fact a proper HTML text of the FGS edition. Since I don’t have flash, when I go say here it gives me this error:

Sorry,need flash player. Get Adobe Flash Player it’s possible to view a simplified version of the book on any device, or you can view the mobile version here

Both of those links are broken, but it does suggest that an HTML version exists. So I tried using the search, and I found that that way I could get results in HTML. It appears as a popup <div>, charmingly styled to look like a window in XP! There’s no URL for the search result, so it’s still not particularly accessible, but at least we know there’s something there.


I’m maintaining the master XML of Yinshun Agama, but it is not publicly available.
right now it is in TEI P5 format but the apparatus is inline, like this: (Taisho SA sutra 203)

<num type=“雜含會編經號” n="254">二五四</num><num type=“大正藏雜含經號” n=“203”>(二〇三)</num></p>
<p>如是我聞:一時,佛住<choice><sic>毗</sic><corr resp="厚觀法師 周怡曄">毘</corr></choice>舍離耆婆拘摩羅藥師菴羅園。爾時,世尊告諸比丘:「若有比丘能

i’m thinking of converting to <anchor> with apparatus section like CBETA .


I have the text converted from Foguang Tripitaka CDROM, I can share with you for non commercial use.


Thanks for this, and apologies for my late reply.

I wonder about what the copyright status is of this type of text, though? Are ancient texts not automatically in the public domain?

For example, some organizations try to lay claims on the Taisho Tripitaka, or even CBETA. But all new canons are simply based on public domain texts from older canons, with some punctuation and spacing added or revised…?


Taisho and FGS Tripitaka claim for plate rights (layout, page number, etc).
which means reprint is prohibited without permission.
According to Taiwan copyright law, plate rights for ancient text (author passed away for more than 50 years) is 10 years.

you can get from , I only keep the core agama text, preface and appendix are removed. Please fork and watch it if you want to build anything on top of it as I am going to change it in near future.


Now that I have the Yinshun version on hand (which I got from here), this might actually be possible in the future. I’ll be cross-referencing the Yinshin and Taisho versions to perhaps make it possible to view either on Dharma Pearls. Even if I don’t go that far, I’ll need to do it just for my own reference, so I’ll bake it into my data files.


Excellent, I’m glad you found this discussion.

Ooh, there’s lots of good stuff on Yap’s repo, I’ll have to check it out.


This html version is also useful in a pinch. The Taisho sutra numbers are the one’s in parentheses, and the abbreviated sutras that the Taisho skipped are numbered, which is how Yinshun ended up with over 10,000. I’ll be implementing a numbering system like SA1.1 etc on my site with these other numbering systems in the data files for reference.


According to Kokuyaku Issaikyoo, the number of SA discourses is 13444 (see Choong Mun-keat, Fundamental Teachings of Early Buddhism, p. 17, n.2).


It probably depends on how exactly you interpret some of the abbreviated sutra descriptions. It’s not like the Pali, rather there are just comments indicating the extra texts (E.g. “Like this sutra above, substitute x, y, z for a.”). It can be less than obvious how many total sutras you end up with sometimes.


I think what I’ll do is input the different numbering systems into the Excel sheet that I had already created to track Pali parallels to the Taisho text. Once I have it all laid out with Yinshun’s 51 samyuktas, I’ll upload it somewhere and post a link.


If anyone is curious about how we end up with 13,000+ sutras, here it is: SA.174-186 have between a couple thousand and a couple hundred variants each. It was a little humorous. I was going along, 2/3 done with the data entry, and it was still only in the 1800s for Yinshun’s count. Then I hit 174, which has almost 3,000 variants by itself. I took Yinshun’s word for it: I think there’s a several different passage variations in different combinations.



I wonder if the extra numbers as compared to the Pali signifies lateness, or whether if you looked closer we might find the Pali can be expanded just as much; it wouldn’t take a lot, a few extra rounds of permutations.


That’s a good question. It’s an anomaly compared to the rest of the Samyukta. I think the most variants any other sutra had was maybe 100, most were between 1 and 50.

It reminds of the time I sat down and surveyed the Satasahasrika Prajnaparamita Sutra a couple years ago. It’s the largest expansion of the Astasahasrika. There’s one chapter in Xuanzang’s translation that goes on for 100 fascicles that’s like that: Just a long set of endless permutations listed out of the same formula. That one chapter is 1/4 of the whole thing.

I’m going to clean up the spreadsheet and see if I can’t finish adding some of the parallels to it, but I’ll post a link to it soon. With the data filters, you can easily switch between the Taisho and Yinshun order and use it to present the reconstruction on SuttaCentral.