Turing Complete Transformers: Two Transformers Are More Powerful Than One
— Burny — Effective Omni (@burny_tech) 7 janvier 2024
"We prove transformers are not Turing complete, propose a new architecture that is Turing complete, and empirically demonstrate that the new architecture can generalize more effectively than transformers."… pic.twitter.com/LGVlZt0afu
The idea is to combine two small transformers rather than one large models
More specialised on given tasks, and prove to be Turing-complete?