• bitfucker@programming.dev
      link
      fedilink
      English
      arrow-up
      1
      ·
      4 months ago

      So does OSM data. Everyone can download the whole earth but to serve it and provide routing/path planning at scale takes a whole other skill and resources. It’s a good thing that they are willing to open source their model in the first place.

    • chiisana@lemmy.chiisana.net
      link
      fedilink
      English
      arrow-up
      0
      ·
      4 months ago

      What’s the resources requirements for the 405B model? I did some digging but couldn’t find any documentation during my cursory search.

      • modeler@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        edit-2
        4 months ago

        Typically you need about 1GB graphics RAM for each billion parameters (i.e. one byte per parameter). This is a 405B parameter model. Ouch.

        Edit: you can try quantizing it. This reduces the amount of memory required per parameter to 4 bits, 2 bits or even 1 bit. As you reduce the size, the performance of the model can suffer. So in the extreme case you might be able to run this in under 64GB of graphics RAM.

        • cheddar@programming.dev
          link
          fedilink
          English
          arrow-up
          2
          ·
          4 months ago

          Typically you need about 1GB graphics RAM for each billion parameters (i.e. one byte per parameter). This is a 405B parameter model.