• lordnikon@lemmy.world
    link
    fedilink
    English
    arrow-up
    31
    arrow-down
    1
    ·
    3 days ago

    Didn’t I just read an article about Apple wanting to use it. It’s open source it can be forked and be American tomorrow. No matter how much Openai crys. This just shows the true lack of any knowledge on the subject. I’m not a fan of the Chinese government as much of the next guy. But the information is out there.

      • lordnikon@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        2 days ago

        I can’t confirm if this is correct or not but another example is you can use deepseek in aws bedrock right now. So it’s makes the same point.

    • prenatal_confusion@feddit.org
      link
      fedilink
      arrow-up
      3
      ·
      2 days ago

      Honest Questions Can a model be open source? Dont we need the data it was trained on to truly be able to reproduce, understand and modify it?

      • lordnikon@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        2 days ago

        No a model is a model now if you are talking a chat bot or ai app yeah you need the data and that data changes all the time but if I have the model I can supply my own and get simalr results to their model if the data I need is the same. The code is right here https://github.com/deepseek-ai/DeepSeek-V3?tab=readme-ov-file I can get the models from https://huggingface.co/deepseek-ai/DeepSeek-V3-Base and train them further and redistub that model legally. Per

        Section III: CONDITIONS OF USAGE DISTRIBUTION AND REDISTRIBUTION

        1. Distribution and Redistribution. You may host for Third Party remote access purposes (e.g. software-as-a-service), reproduce and distribute copies of the Model or Derivatives of the Model thereof in any medium, with or without modifications, provided that You meet the following conditions: a. Use-based restrictions as referenced in paragraph 5 MUST be included as an enforceable provision by You in any type of legal agreement (e.g. a license) governing the use and/or distribution of the Model or Derivatives of the Model, and You shall give notice to subsequent users You Distribute to, that the Model or Derivatives of the Model are subject to paragraph 5. This provision does not apply to the use of Complementary Material. b. You must give any Third Party recipients of the Model or Derivatives of the Model a copy of this License; c. You must cause any modified files to carry prominent notices stating that You changed the files; d. You must retain all copyright, patent, trademark, and attribution notices excluding those notices that do not pertain to any part of the Model, Derivatives of the Model. e. You may add Your own copyright statement to Your modifications and may provide additional or different license terms and conditions - respecting paragraph 4.a. – for use, reproduction, or Distribution of Your modifications, or for any such Derivatives of the Model as a whole, provided Your use, reproduction, and Distribution of the Model otherwise complies with the conditions stated in this License.

        Use-based restrictions. The restrictions set forth in Attachment A are considered Use-based restrictions. Therefore You cannot use the Model and the Derivatives of the Model for the specified restricted uses. You may use the Model subject to this License, including only for lawful purposes and in accordance with the License. Use may include creating any content with, finetuning, updating, running, training, evaluating and/or reparametrizing the Model. You shall require all of Your users who use the Model or a Derivative of the Model to comply with the terms of this paragraph (paragraph 5).

        • prenatal_confusion@feddit.org
          link
          fedilink
          arrow-up
          2
          ·
          1 day ago

          Okay let’s talk terminology

          I meant the useable application as a for example chat bot.

          The model itself is useful as far as it enables me to understand the underlying logic of the whole thing right? But without the data and I guess a training period with it I can’t use it if I understood it correctly.

          If I am right with the above this wouldnt make it useful as in I can reproduce it as a functioning application and I wouldn’t call that opensourve in my understanding of the word.

          • markinov@lemmygrad.ml
            link
            fedilink
            English
            arrow-up
            1
            ·
            14 hours ago

            I don’t know much as i ain’t knowledgeable in ML and LLM, but i downloaded deepseek R1 7B model (which probably is the weights and architecture) from huggingface and it worked with no “data”. So i can say it’s reproducible. Now does that make it open source?