Tag: autoregressive transformers

  • Hacker News: Trap – Transformers in APL

    Source URL: https://github.com/BobMcDear/trap Source: Hacker News Title: Trap – Transformers in APL Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses an implementation of autoregressive transformers in APL, specifically focused on GPT2, highlighting its unique approach to handling performance and simplicity in deep learning. It offers insights that are particularly relevant to…

  • Simon Willison’s Weblog: Quoting Andrej Karpathy

    Source URL: https://simonwillison.net/2024/Sep/14/andrej-karpathy/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Andrej Karpathy Feedly Summary: It’s a bit sad and confusing that LLMs (“Large Language Models") have little to do with language; It’s just historical. They are highly general purpose technology for statistical modeling of token streams. A better name would be Autoregressive Transformers or something. They…