summary
Generative artificial intelligence has brought about three major copyright disputes and urgently requires a legal response. Firstly, regarding the copyright dispute of content generated by artificial intelligence, the concept of "people-oriented" should be adhered to, and "creation" should be understood as the act of directly determining expressive elements based on free will. Therefore, content generated by artificial intelligence should not be protected as a work. Secondly, regarding the infringement disputes generated by artificial intelligence, the responsibilities of all parties should be clarified from the perspective of unified rights and responsibilities, and the "assumed natural person testing method" can be used as the standard for confirming rights and responsibilities. Thirdly, regarding the controversy over artificial intelligence training, the basic theory of copyright law is difficult to provide a legitimate basis for the unauthorized establishment of training corpora that can accommodate massive works. We should base ourselves on the reality of China and promote innovation in legal protection.
Keywords: AI generated content, copyrightability, AI generated infringement, AI training
Wang Qian | Professor at East China University of Political Science and Law
1. Introduction
With the rapid development of generative artificial intelligence, related copyright legal issues are becoming increasingly prominent. The most prominent controversies currently include at least three aspects: firstly, whether the content generated by artificial intelligence based on prompt words can constitute a work in the sense of copyright law; Secondly, how to determine infringement and allocate liability when the content generated by artificial intelligence is identical or substantially similar to the prior work; Thirdly, training artificial intelligence with massive works, especially by crawling and collecting works and establishing a corpus that can be repeatedly called, whether it falls under exclusive rights control such as replication rights, and whether the current fair use rules can accommodate such behavior.
The recently released "Outline of the 15th Five Year Plan for National Economic and Social Development of the People's Republic of China" has clearly proposed to "explore the establishment of rules for the ownership of artificial intelligence generated products and the recognition of rights and responsibilities of developers, operators, and users" and "establish a system for the rational use of artificial intelligence training data". What is currently most needed is not to avoid conflicts conceptually, but to face them squarely within the framework of legal protection and solve the copyright issues of generative artificial intelligence through legal means. Copyright law should not only protect the rights and interests of authors, encourage creation and dissemination, but also reserve public domain and reasonable utilization space to avoid excessive suppression of social knowledge circulation caused by the expansion of rights. Therefore, it is necessary to explore a generative artificial intelligence copyright legal protection scheme that conforms to the concept of balancing interests. To resolve the controversy over whether content generated by artificial intelligence belongs to works, it is necessary to clarify the connotation of "creative behavior"; Resolving disputes over artificial intelligence generated infringing content requires clear rules for determining infringement and assuming responsibility; The problem of artificial intelligence training requires rebuilding a predictable institutional balance between innovation incentives and rights protection.
2、 Qualitative Dispute Analysis of Artificial Intelligence Generated Content under the Concept of Putting People First
The Global Initiative for Artificial Intelligence Governance proposes that the development of artificial intelligence should adhere to the concept of "people-oriented". This viewpoint is not only in line with the Marxist advocacy of human liberation and opposition to human alienation, but also in line with the humanistic concept in traditional Chinese culture. The theoretical system and specific system of copyright in the era of artificial intelligence should always be centered on "people". This article argues that "creation" is a behavior based on free will that directly determines expressive elements, which reflects the concept of "people-oriented".
The first paragraph of Article 3 of the Implementation Regulations of the Copyright Law of the People's Republic of China (hereinafter referred to as the "Implementation Regulations of the Copyright Law") stipulates that "creation referred to in the Copyright Law refers to intellectual activities that directly produce literary, artistic, and scientific works." The second paragraph of Article 3 of the Implementation Regulations of the Copyright Law stipulates that "organizing work for others to create, providing consulting opinions, material conditions, or carrying out other auxiliary work shall not be considered as creation." Among them, "directly producing... works" emphasizes the free will of civil subjects to determine the expressive elements required to constitute works. The connection between it and the work is so close that it can only be described as' direct 'rather than' indirect '. Correspondingly, "indirectly producing... works" includes giving instructions, guidance, opinions, or suggestions on the creative concept, style, goals, and methods of the work. It cannot directly determine the expression that constitutes the work, but requires the author to choose and judge on their own, and implement it in a personalized way. A typical example is that a teacher who is required to draw and write hundreds of words for a student cannot become the author of the student's artwork themselves.
To determine whether the developer or user of artificial intelligence has "created" relevant content using artificial intelligence as a tool, it is necessary to determine whether the developer or user directly determined the expressive elements that constitute the relevant content based on their free will. The most direct external manifestation of this standard is that "direct decision" means that many different creative tools of the same kind, when dealing with the same requirement, obtain the same expressive elements, which can only produce technical differences. For example, using various computer word processing programs, such as Microsoft Word from the United States and WPS from Kingsoft Corporation in China, to perform the same text input on several computers with significantly different configurations, the result is inevitably the generation of the same text combination, word choice, and sentence construction. For AI generated content, AI developers naturally need to design algorithms and models, and use a large amount of data to "train" AI, repeatedly correcting errors, so that AI can autonomously generate content for different needs. However, artificial intelligence developers cannot directly decide the content generated by artificial intelligence based on their free will through the above operations alone. Similarly, users who issue instructions to artificial intelligence to complete specific tasks cannot rely on their free will to determine the content generated by the AI, and therefore cannot be considered to have engaged in the "creation" of the work. Although user instructions define the direction and field of content generated by artificial intelligence, and satisfactory parts can be selected from the numerous contents generated by artificial intelligence, they cannot directly determine the expressive elements of this part of content.
The skeptical view on this is that if the prompt words are specific to the function level, such as inputting the coordinates of a painting created by the user to the artificial intelligence, it can recognize the user's original contribution. However, this situation means that artificial intelligence has become a technological tool for replicating user artwork, rather than autonomously "generating" content, which contradicts the name "generative artificial intelligence" and the logical starting point for discussing whether generated content belongs to a work. Of course, if technology really develops to the point where artificial intelligence can reach the technological stage described in the science fiction movie "Inception" (the so-called brain computer interface technology is probably developing in this direction), when the painter has already conceived all the expressive elements of the work in their mind, that is, the so-called "bamboo in their heart", artificial intelligence infiltrates into human consciousness and turns the "bamboo in their heart" into a bamboo image on the computer screen, then indeed "the painter's foresight for the original expression of the final picture may be close to 100%". However, is the artificial intelligence under this assumption still the Generative Artificial Intelligence we are discussing now? I'm afraid changing it to 'Reprographic Artificial Intelligence' would be more appropriate!
This article argues that content generated by artificial intelligence should be excluded from the scope of the work. As for the intellectual contributions that users may make in the process of generating content through artificial intelligence (such as creatively modifying, assembling, or "deducing" the content generated by artificial intelligence, or designing prompt words that can form a written work), including them in the scope of copyright protection like other works has never been controversial. The question of whether to establish adjacency rights for AI generated content, that is, to provide special protection, is certainly worth taking seriously. The right of adjacency does not require the creation of original content, but like the right of authorship, it is an absolute right and a right to the world in civil law. Whether to create such adjacency rights involves complex interests and balance of interests. There is no precedent internationally for setting adjacency rights (cultural achievement rights other than works) for content generated by artificial intelligence. This article does not presuppose any viewpoint, but believes that at least extensive research should be conducted to actively track the application practice of artificial intelligence generated content, examine the development trend of artificial intelligence technology and the needs and expectations of the industry. Only in this way can we take root in China's reality and make institutional innovations that are in line with the national conditions.
3、 A review of infringement disputes generated by artificial intelligence from the perspective of unified rights and responsibilities
On April 25, 2025, General Secretary Xi Jinping pointed out during the 20th collective study session of the Political Bureau of the Communist Party of China Central Committee that the regulatory system and mechanism for artificial intelligence should be improved, and the initiative in the development and governance of artificial intelligence should be firmly grasped. The copyright infringement risks brought by artificial intelligence need to be effectively regulated and governed. When artificial intelligence "plagiarizes" a prior work and generates content that is essentially identical or substantially similar to the prior work, it will lead to complex issues of infringement determination and liability.
Copyright law tolerates coincidences, which means that both parties have the copyright to independently completed highly similar or even extreme identical content based on coincidences, and do not infringe on each other's copyright. However, both domestically and internationally, in copyright infringement lawsuits, there are few cases where the accused party can prove that their subsequent content is purely based on coincidence and substantially similar to the plaintiff's prior work. Similarly, the argument of coincidence is difficult to apply to scenarios where artificial intelligence generates infringement. The saying goes, 'A clever woman cannot cook without rice.' When the content generated by artificial intelligence is substantially similar to a prior work that is complex to a certain extent, a more reasonable explanation is that artificial intelligence uses some technical means to substantially reproduce the prior work used as training material, rather than purely creating content that is substantially similar to other people's works on its own. If many users request generative artificial intelligence to provide them with a painting of another person within the copyright protection period, and the generative artificial intelligence directly and stably generates the same or substantially similar content, then evaluating the situation where this training model generates similar results as a technical coincidence would be biased.
The perspective of unified rights and responsibilities helps to evaluate the responsibilities of all parties involved in disputes over infringement generated by artificial intelligence. The unity of rights and responsibilities implies the unity of rights and obligations, where "rights" are not limited to copyright or other legal rights, but generally refer to interests. At the same time as obtaining corresponding benefits, it is usually necessary to assume corresponding obligations. Artificial intelligence developers invest huge amounts of money in developing generative artificial intelligence in order to form corresponding business models, directly providing AI generation services or embedding corresponding products to provide AI generation services in exchange for economic benefits. This commercial interest is the driving force behind the development of artificial intelligence, and it is also the "right" (interest) in the "unity of rights and obligations". Correspondingly, when artificial intelligence developers obtain rights from it, they also need to assume corresponding obligations. If the artificial intelligence provided by the developer has serious defects and can easily and stably generate infringement results, the developer cannot escape responsibility and needs to take measures to stop infringement and bear compensation liability depending on the degree of fault.
The hypothetical natural person testing method provides a feasible solution for determining rights and obligations. This method refers to first assuming that artificial intelligence is a natural person, and assuming that the actual technological process implemented through artificial intelligence is a behavior with the same effect implemented by a natural person, in order to analogize and compare the seemingly difficult and controversial issues, and then replacing the assumed natural person with artificial intelligence in the conclusion (with the core difference that artificial intelligence has no civil subject qualification), in order to observe the rationality of the conclusion. There has always been controversy over whether users, AI service providers, or AI developers should be held responsible for the plagiarism of prior works by artificial intelligence. But if we compare generative artificial intelligence to a network jukebox, we can see that traditional copyright law has already established rules for resolving disputes. At this point, users are natural persons who request music from online jukeboxes, and artificial intelligence developers can be likened to the manufacturers and sellers of online jukeboxes. In the case where a network jukebox provides prior music works to others, users only provide instructions and enjoy them in their private space, without engaging in exclusive rights such as copying works. Storing music works on a computer for future enjoyment is considered reasonable use. As for whether the subsequent dissemination of the work by the user constitutes infringement, it shall be judged separately. At the same time, although artificial intelligence developers have not directly engaged in "plagiarism" behavior, just like the manufacturers and sellers of online jukeboxes who are responsible for infringement, they may be held responsible for the damages caused by AI "plagiarism" of prior works. In addition, the above methods can also be applied to preliminarily examine the responsibilities of artificial intelligence access and aggregation service providers, in order to clarify whether they apply the "notification and necessary measures" rule and whether they have fulfilled their reasonable duty of care and taken "necessary measures" to prevent infringement.
In short, to solve the problem of artificial intelligence generating infringing content, we still follow the general principles of copyright law and civil code infringement liability. Unifying rights and responsibilities to encourage artificial intelligence developers to increase investment and technological research and development, and prevent the generation of infringing content by artificial intelligence, is of great benefit to the healthy development of artificial intelligence. Technological development may advance rapidly, but the basic principles and operational logic of law can remain relatively stable for a considerable period of time, as new technological phenomena do not necessarily bring new problems that can overturn existing rules.
4、 A review of the controversy surrounding artificial intelligence training under policy guidance
On April 25, 2025, General Secretary Xi Jinping pointed out during the 20th collective study session of the Political Bureau of the Communist Party of China Central Committee that we must face up to the gap, double our efforts, and comprehensively promote innovation in artificial intelligence technology, industrial development, and empowering applications. The Global Initiative for Artificial Intelligence Governance states that the development of artificial intelligence should be aimed at enhancing the common well-being of humanity, with the premise of safeguarding social security and respecting human rights. The proper resolution of copyright issues related to artificial intelligence is not only related to legislation and judiciary, but also closely related to the artificial intelligence industry.
Most artificial intelligence developers need to establish internal knowledge databases (training corpora) containing massive works to train artificial intelligence, but this massive, systematic, and large-scale commercial use has not yet been included in the scope of rights restrictions stipulated by any international treaty or national copyright law. Article 7 of the Interim Measures for the Administration of Generative Artificial Intelligence Services jointly issued by the Cyberspace Administration of China and seven other departments stipulates that when conducting training, providers of generative artificial intelligence services shall not infringe upon the intellectual property rights enjoyed by others in accordance with the law, if it involves intellectual property rights. However, the regulation does not specify under what circumstances infringement can be recognized. How to qualitatively and standardly train artificial intelligence using works has sparked the most intense debate worldwide. The main reason for this is that the development of artificial intelligence technology is too rapid, and existing copyright laws and institutional design theories are not yet prepared.
The existing viewpoint often cites theories of non expressive or transformative use originating from the West as a basis for the legitimacy of artificial intelligence training. However, even Western cases indicate that large-scale internal replication that has a substantial impact on the subscription market does not constitute fair use. Judge Leval, the proposer of the concept of transformative use, believes that the defendant company's library's copying of journal articles is neither transformative nor non-commercial, but rather substitutive. The defendant company only mechanically copied the relevant articles. The copy of the original document is not intended to be used as a part of a larger entity for certain new purposes. The main purpose of the defendant company's copying is to replace the original and allow for copying, even in bulk. Researchers can make copies of an article for future reading or reference, eliminating the need to share the original journal with others. This type of copying behavior does not make any substantial additions or modifications to the original work, it only achieves an increase in the number of copies. The so-called 'transformative use' requires the creation of new content that is different from the original, rather than simply making copies for a beneficial purpose. The second instance also supports this idea, as the court found that the defendant company cannot obtain a reasonable use exemption just because it copied the archives of an article (or book). Similarly, a newspaper cannot claim that, as its business is "news reporting," it can self copy news books and place them on journalists' bookshelves for them to learn from, nor can a school claim that in order to help teachers improve their teaching abilities, it can make a large number of copies of educational books and have teachers read the copies. No matter how much benefit copying and reading such books may bring to improving teaching ability, it cannot constitute reasonable use for teaching purposes.
This article argues that artificial intelligence training is deeply embedded in the context of China's industrial upgrading, technological competition, and institutional adjustment, and therefore has a strong policy orientation. As pointed out in the "Outline for Building a Strong Intellectual Property Country (2021-2035)", intellectual property protection should "highlight key areas and major needs, and promote the deep integration and development of intellectual property with various aspects such as economy, technology, culture, and society". Against this backdrop, the "non expressive use," "transformative use," and "opt out" currently being discussed in the West are difficult to provide a permanent solution to the copyright issues arising from using works to train artificial intelligence. At different stages and periods of the application of artificial intelligence technology, there should be policies and institutional designs that are suitable for the social and economic conditions and national conditions at that time. The next more feasible direction is not to forcefully apply the problem of artificial intelligence training to these previously emerged concepts and mechanisms, and thus universally recognize that using works to train artificial intelligence is a reasonable use. On the basis of fully grasping China's actual national conditions, it is necessary to clarify the crux of the problem of artificial intelligence training, promote the improvement of relevant mechanisms, and safeguard the rights and interests of rights holders. In this process, some temporary institutional designs with transitional nature may be needed, and more explanatory Chinese discourse may be extracted from them to achieve a spiral rise in the legal protection of artificial intelligence training.
V. Conclusion
While maintaining the boundaries of copyright holders' rights, necessary and clear legal guarantees should also be provided for the development of new technologies. What is truly needed in the era of generative artificial intelligence is to provide clearer and more forward-looking rules to delineate predictable action boundaries for all parties involved. Only in this way can technological development not come at the cost of excessive erosion of rights, and rights protection can also obtain a sustained institutional foundation in response to changes in reality.
*The above article is from the 4th issue of China Rule of Law in 2026
Source: China Rule of Law Online