• kablammy@sh.itjust.works
      link
      fedilink
      arrow-up
      1
      ·
      9 months ago

      It would have been more obviously gradient descent if they didnt start with 0, so the first gradient wasn’t the same as the second answer. I thought they were just repeating the last correct answer.