Hi PyMC community. I’d like to build a chat bot specifically to help with PyMC/bayesian related questions internal to my employer’s eco-system. If I need to feed internal data or proprietary information, I can’t use any publicly available LLM. Is this ok? Does anyone know if I need some sort of license or permission to use the code base for this private, internal bot?
What are you asking for exactly? The PyMC code base has a general permissive license that should allow you to feed completely into the system: pymc/LICENSE at main · pymc-devs/pymc · GitHub
Not sure what’s the status of discourse, but it’s all in the open, so I’m pretty sure you can also scrap it, although discourse may have some limitations inplace.
See: Terms of Service - PyMC Discourse
Short answer from the link is that the posters maintain copyright to their posts, but license them to the site under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
I’m not an expert on CC licenses, but the non-commercial aspect of this may be problematic. Usually share alike only kicks in when you try to distribute.
CC is generally considered a suboptimal license under which to distribute code, but I’m not entirely sure why. Here’s CC’s own take: