Makes sense. AAVE is mostly a spoken thing, LLMs are mostly trained on the corpus of written text on the internet and in books. It's pretty rare for people to write in an AAVE style in those contexts.
Except it has no difficulty reading and understanding AAVE, because people use it online frequently...
Like, the article makes that abundantly clear, but everyone commenting just read the headline and assumed what it meant was it couldn't understand it...
Well, if the training data is largely standard english, AAVE could look like less educated English, because it doesn't follow the normal rules and conventions. And there's probably a higher correlation between AAVE use and lower means and/or education because people from the black community who have higher means and/or education probably use standard English more often because that's how they're trained.
So I don't think this is evidence about the model being "racist" or anything of that nature, it's just the model doing model things. If you type in AAVE, chances are higher that you fit the given demographic, because that's likely what the training data shows.
So, I guess don't really see the issue here? This just sounds like people thinking the model does more than it does. The model merely matches input text to data in the model. That's it. There's no "understanding" here, it's just matching inputs to outputs.