Recent comments in /f/MachineLearning

impossiblefork t1_je5l8k9 wrote

How would either an architecture or a model be copyrightable?

Architectures are algorithms. If they aren't patentable and are in addition to that patented, they have no protection.

Model weights are a result of a mechanical procedure that fits a model to data, minimising some kind of error. That is not a work of human authorship.

Things that could be copyrightable are an article describing a model architecture, or a specific software implementation of a model.

As an argument why model weights are unlikely to be copyrightable consider the following parallel: we know that model output, for example, a story generated by ChatGTP based on a prompt is certainly not copyrightable, since it's not a work of human authorship, but then, how is the model? We can view the selection of training examples as something similar to a prompt and the training process as similar to the inference. I think giving copyright protection to model weights might be reasonable though, but I think it's unlikely that they have copyright protection.

3

thedamian t1_je5eweg wrote

Before answering the question, I would submit that you should be thinking of keeping your models behind an api. No need to have it sitting on the client side (which is why it feels you're asking the quesiton)

And behind an API it can be as big as you'd like or can afford on your server)

2

OldManSaluki t1_je5cmjw wrote

I'm leaning in this direction myself.

IANAL, but I think about the Feist Publications ruling which dealt with raw listings of facts (white pages names, addresses, phone numbers organized in the most functional format - alphabetic.) SCOTUS ruled that the raw data was not copyrightable even though it took a lot of effort to collect and compile it. It seems to be that the raw data here are the weights which would make them not copyrightable. The structural design of the model might be, and more than likely the compiled model with weights would be copyrightable.

I suspect this will work its way through the courts just in time to be rendered moot.

3