yes, you can match on user agent, and then conditionally serve them other stuff (most webservers are fine with this). nepenthes and iocaine are the current preferred/recommended servers to serve them bot mazes
the thing is that the crawlers will also lie (openai definitely doesn't publish all its own source IPs, I've verified this myself), and will attempt a number of workarounds (like using residential proxies too)
I hadn’t encountered either the Howard person nor heard of this podcast, but imma find that episode and listen because it sounds like quite an experience!
many of the proponents of things in this field will propose/argue $x thing to be massively valuable for $x
thing is, that doesn't often work out
yes, there's some value in the tech for translation outcomes. to anyone even mildly online, "so are language teaching apps/sites using this?" is probably a very nearby question. and rightly so!
and then when you go digging into how that's going in practice, wow fuck damn doesn't that Glorious AI Future sheen just fall right off...
your posts keep just slinging words together and it’s just fucking weird