NPUs seem to be targeted towards running tiny ML models at very low power, not running large AI models.