Perhaps in the future such techniques may allow scholars to restore or read ancient Buddhist scriptures? Wondering what people on the forum would think of such a thing if it did come to pass?
I wonder whether the current forum rules would even allow printing any such scriptures if AI were involved in piecing them back together
UPDATE: Here is another article which goes into a bit more detail about the āAIā portion that helped decipher these texts.
Farritor spent much of the past year developing and training a machine-learning model that could detect ultra-faint differences in the texture of the carbonized scrolls, which are now too delicate to unroll. Those textural differences hinted at the presence of ink ā and Greek letters that many thought would never be read again. Eventually, Farritorās model managed to identify 10 letters in close proximity, enough to earn him the Vesuvius Challengeās First Letters Prize. Experts would soon conclude that several of those letters spelled the Greek word for āpurple.ā
This article on arxiv [2102.05095] Is Space-Time Attention All You Need for Video Understanding? describes in precise detail the AI that is being used to read these scrolls. It is a variant of a Transformer based model - the same kind of AI architecture that powers OpenAIās ChatGPT - so this would seem to be āreal AIā as opposed to the fuzzy kind
I guess that would depend on whether the AI is able to explain its reasoning steps in a manner that human experts could verify them? Probably similar to how human experts are verified? Iām just speculating though and I guess youād have to read the paper to understand how it is verified.
Btw, Iāve had no problem finding plenty of human gibberish when it comes to philosophy either
Just a thought experiment: Imagine that a new found manuscript contained the sentence in Pali: āThe Buddha affirmed everyone the existence of a Soul.ā
Invevitably, by changing just two or three Pali characters, the sentence reads differently and is not at odds with 95% of the existing material in which the Buddha does not affirm the existence of a Soul.
How likely do you think it is that AI will come up with the correct reconstruction?
There is gibberish ā¦ and then there is gibberish.
Wait a minute. So now if someone combines an AutoCAD application with an OCR application, itās āAIā? The term isnāt even meaningful anymore. Anything that processes any kind of data is AI now. Reality check: Thatās ordinary software people have been using for decades. The X-ray photography is where the magic occurs, as thatās how the writing is made visible.
Hereās an older story about the same process back in 2015:
Notice how the buzzword āAIā isnāt mentioned?
Hello cdpatton, while I share your amusement about the ever widening embrace of the word AI in novel ways, in this case I think it might not be applicable.
It does seem that an actual AI was trained - a machine learning model - to improve on the techniques that you note have been in use for a long time.
Here is one of articles I found with a bit more detail:
Farritor spent much of the past year developing and training a machine-learning model that could detect ultra-faint differences in the texture of the carbonized scrolls, which are now too delicate to unroll.
You can find a nice video within where the person who came up with the āAIā portion talks about what he did and how the it made it possible to deciphe a portion of these texts.
Technically OCR could be āmachine learningā.
What exactly is the machine learning, with regards to these scrolls? Surely it has a familiar character set and there is no quirky human change factors.
This is using the popular pytorch framework to train a machine learning model based on the resnet architecture. It is closely analagous to the CLIP (Contrastive LanguageāImage Pre-training) model from OpenAI you can read more about here.
Perhaps such models are not āAIā for purposes of the community guidelines then and weāre able to share results from such models on the forums? Can moderators clarify if this is allowed since āAIā is such a fuzzy term? Thanks!
I donāt understand this stuff well enough to want to form or state any confident opinions, but maybe itās a good starting point for someone elseās further research into how accurate these methods are.
Hi @Adutiya, the original article doesnāt go into the AI very much but if you read the other links that have been posted by myself in the UPDATE and by @Viktoria in her comment you can get more info about the AIās being developed for this.
You can see transformer architectures (what ChatGPT is made out of) as well as ResNet architectures (which OpenAIās CLIP basically is) given so you can see that real AI (as opposed the fuzzy fake kind Charles was referring to ) is actually being used to piece together these scrolls
Which goes into how they are using a transformer architecture as part of the āAIā in this work which is exactly the architecture that ChatGPT from OpenAI is made out of. Iām pretty sure that ChatGPT is considered āreal AIā for purposes of this forum as a lot of ire has been directed against the transformer architecture or the technologies that have come out of it. It is what powers nearly all the tech that is currently in the news re: AI.
From the paper:
We present a convolution-free approach to video classification built exclusively on self-attention over space and time. Our method, named āTimeSformer,ā adapts the standard Transformer architecture to video by enabling spatiotemporal feature learning directly from a sequence of frame-level patches.
Thatās kinda cool - I named my daughter Tyrian - which is the shade of Purple once only worn by royalty (due to the expense of harvesting the sea snails used for it) and the dye for it was first made in Ancient Greece.
Iād love to know if the actual word deciphered is my daughters name.
ā¦ and No, it had nothing to do with āGame of Thronesā ( I get asked that all the time lol) I hadnāt even watched it. It was a nod to the highest academically credentialed Spiritualist on the planet who was the chairman of the education committee and first ever āfellowā of the SNU UK (Spiritualist National Union), member of Mensa, wrote the courses Spiritualist world wide study for their credentials, ā¦ and who took me under her wing and spent years educating me to understand certain little known weird abilities. Her name was Violet ā¦ but I chose a different shade of purple
The term āmachine learningā is very broad and would include things as simple as calculating the MSE of a data set:
For the burnt scrolls above, I believe PyTorch was mentioned? PyTorch has a wide range of applications. If you had enough burnt scrolls at hand and knew their actual contents you could theoretically train a model in this way. I just made that up of course, Iām not sure how the model above was trained.
Yes, I see that now. It sounds like they are using machine-learning tool to automate the OCR process, like astronomers sorting images of galaxies with machine-learning software. Which is cool; it saves everyone a lot of time if it works well enough.
I think I was initially reacting to the claim that it was āthe first timeā it had been done, and I remembered this being in the news years ago. I guess what they meant was it was the first time that particular scroll had been read with a machine-learning tool. Okay, then.
The Smithsonian has a good article that goes into detail about how this process was invented and the science behind it. Itās a great example of thinking āoutside of the boxā to solve a problem that seems impossible using existing technologies in new ways:
It makes you wonder what other seemingly impossible things are possible if someone were to realize how to reapply existing technology. That creative spark that sets the wheel in motion is a mysterious thing; sometimes itās a response to some happenstance that makes a person think, āWait, what if I did this ā¦ā and one thing leads to another.
And since itās such a small world (as they say), this young fellow featured in the video ā Mr. Farritor ā was a SpaceX intern last year. He is now one of Mr. Muskās carefully selected āyoung engineersā who is deciding how to revolutionize the US federal government with AI. This includes deciding how many current employees can be tossed to the street.
Last year, in a University of Nebraska press release video, he talked about calling his mom when he heard the news that heād been selected as a winner for decoding one of the words on the scroll with AI.
Any 22-year-old would want to call their mom after achieving that. But I doubt she understands what heās being tasked with now.
Yes, I actually became aware of his work decoding the Vesuvius scrolls by first reading of his association with DOGE. I sincerely hope he returns to work on the scrolls asap.