I donāt see this as scrutinizing a good-faith effort so much as trying to understand the principles being applied.
I agree that there has been a good-faith effort to draw lines around AI use, and I donāt think anyone is being willfully ignorant about the complexity involved.
My confusion is that the public discussion often combines two different arguments.
One argument is that AI-generated translations, explanations, and similar textual outputs should not replace human understanding and judgment. That seems like a fairly clear principle.
The other argument is a broader moral critique of the AI industry, the companies involved, and the direction the technology is taking.
Those lead to different conclusions.
If the concern is primarily about machine-generated Buddhist content, then many other uses of AI may be acceptable. If the concern is broader than that, then the boundaries become much less obvious.
Thatās why I brought up DPD. Not because I think anyone has done something wrong, but because it illustrates how difficult it can be to tell where the line actually is.
For example, would using an LLM to classify suttas by topic be acceptable? Would AI-assisted metadata generation be acceptable? Would AI-assisted identification of parallels be acceptable? Would AI-assisted search be acceptable? What if that search was natural language with citations?
I genuinely donāt know the answers, and I think those questions become increasingly relevant as more Buddhist software projects begin using these tools.
More importantly, if SuttaCentral is going to make a public request that its content not be used in certain ways, then I think there should be a reasonably clear explanation of where those boundaries are.
The data is intentionally placed in the public domain, so developers are free to build with it. At the same time, there is an expressed wish that some uses are inconsistent with the values or intentions behind the project. Thatās perfectly reasonable. But if those boundaries are not clearly defined, then someone could make a sincere, good-faith effort to build something they believe is aligned with those wishes, only to later discover that others in the community view it as crossing a line.
That seems unfair both to developers trying to act in good faith and to the community trying to communicate its expectations.
So Iām less interested in whether a particular project passes or fails the test than I am in understanding what the test actually is.
@sujato
Since the original AI essays and policy discussions were written by you, I was hoping you could clarify how you think about some boundary cases.
Iām less interested in whether any particular project is approved and more interested in understanding the principles.
Could you clarify the following?
-
Where do you draw the line between AI-generated Buddhist content and AI-assisted tooling such as classification, metadata generation, search, discovery, or corpus organization?
-
If a project never generates translations, commentary, doctrinal explanations, or chatbot-style responses, but does use an LLM to help classify, organize, tag, or search SuttaCentral data, would that be consistent with the principles you are trying to uphold?
-
If SuttaCentral data is provided to an AI system such as Claude, Grok, or ChatGPT as context for a project, but the output is limited to search, retrieval, categorization, or other non-generative functions, is that something you would consider acceptable, or does the concern begin at the point where the data is being processed by the model?
-
Is the primary concern the generation of Buddhist content by AI, or does the concern extend to non-generative uses of AI that help organize, classify, or retrieve information?
Iām asking because the data is CC0 and developers may want to respect the spirit of your wishes while reaching different conclusions about where the boundaries lie. It would be helpful to understand what principles determine those boundaries.
Thank you.