I made a programming language to test how creative LLMs really are
Not because I needed to. Not because it’s efficient. But because current benchmarks feel like they were built to make models look smart, not prove they are. So I wrote… Weiterlesen »I made a programming language to test how creative LLMs really are