Posted on 3 Comments

Why is it art?

a wall painting at lascaux cave

Since the cave paintings at Lascaux and similar locations were painted before concepts of composition were even thought of, are they art?

Some time ago, in a Facebook group, someone asked this good question.

My initial answer was as below.

I doubt if the people making them saw them as ‘art’. That’s imposing our view of the world on the makers. From our perspective, though, yes they are art. Plus, places like Lascaux were painted by numerous hands, perhaps over centuries. They are not single compositions. What they show though is considered mark making towards an idea.

I suspect the principles they were following were more in the line of magical thinking than aesthetics. They may have thought, “if I paint this animal being killed, we will be successful on our next hunt.” Another possibility is that it was an offering to the spirit world, giving thanks for a successful hunt. Both these have been found in modern times by anthropologists. Whatever it was, they seem to have had something guiding them. They were not just throwing pigment around at random.

Sadly, the discussion didn’t progress much. The rest of this post is based on the argument I was trying to make. It includes a few extra points that didn’t occur to me at the time.

I now think the original question was based on a false premise. We don’t know if they had ideas about composition. We do know that they are not random daubs on a wall. They represent a significant human achievement, only possible because of a great deal of effort and time. They meant something to their makers.

We can never know the motives of the makers or the guiding principles they were working under. We can only speculate. Our speculations cannot fail to be coloured by our own world view, as was the original questioner.

The question of what was in the minds of the original makers of these paintings is a different question to whether they are ‘art.’ We don’t even know if the makers had a concept of art as an endeavour in its own right. There seems to have been an urge to decorate, shown in other finds from many similar cultures, but we still don’t know why it was done.

If we shift our viewpoint from looking at cave paintings to looking at scientific illustrations, it is perhaps clearer. Hooke made incredibly detailed and brilliantly executed drawings of what he saw in his microscope. These are hugely valuable in terms of their scientific intent. To see them as art means looking at them from the perspective of a separate set of values to those of the original maker. The one doesn’t negate the other. Both reference frameworks can apply simultaneously.

In the end, I suspect defining art is like defining a game. After all, what links tennis, golf, poker and Resident Evil? All games, but we would find it hard to describe the common characteristic. So in my view art includes Lascaux cave paintings, Neolithic rock carving, Medieval illuminated manuscripts, Rembrandt, Monet, Malevich, Hockney, Basquiat, and David Bailey.

Hooke is close enough in time for us to have some idea of his thought processes. I think we can feel reasonably confident that he had a sense of his work as having an aesthetic value beyond being a ‘just’ a scientific illustration. We can’t know whether the creators of cave paintings had ideas or concepts of composition, or what they were thinking. We can be sure, though, that they were thinking…

This post is related to several others about meaning in art.

Neolithic art has also provided inspiration for many of my own prints.

Mixed media print - monotype and pastel
Rocking in Rhythm #1 mixed media pastel over monoprint
Collagraph print inspired by neolithic rock carvings
Hammer marks on Weardale
Posted on 5 Comments

Art, AI and AI art – Part 2

An AI image of Aphrodite rising from the waves, after the original by Botticelli

Introduction to Part 2

This post began as a limited review of what has become known as AI Art. In order to do that, I had to delve deeper into AI in general. Consequently, the post has grown significantly and is now in three parts. Part 1 looked at AI in general. This post, Part 2 will look at more specific issues raised by AI art. Part 3 will look at the topic from the perspective of an artist. Finally, Part 4 will be a gallery of AI-generated images.

This isn’t a consumer review of the various AI art packages available. There are too many, and my budget doesn’t run to subscriptions to enough of them to make such a review meaningful. My main focus is on commonly raised issues such as copyright, or the supposed lack of creativity. I have drawn only on one AI app. Imagine AI, for which I took out a subscription. I tried a few others, using the free versions, but these are usually almost shackled by ads or with only a limited set of capabilities.

Links to each post will be here, once the series is complete.

How do AI art generators work?

What they do, is take a string of text, and from that, generate pictures in various styles. How do they achieve that? The short answer is that I have no idea. So, I asked ChatGPT again! (Actually I asked several times, in different ways) I’ve edited the responses, so the text below is my own words, using ChatGPT as a researcher.

In essence there are several steps, each capable of inserting bias into the output.

Data Collection and Preprocessing

The AI art generator is trained on a large dataset of existing images. This can include paintings, drawings, photographs, and more. Generally, each image is paired with a text that in some way describes what the image is about. The data can be noisy and contain inconsistencies, so a certain amount of preprocessing is required. The content of the dataset is critical to the images that can be produced. If it only has people in it, the model won’t be able to generate images of cats or buildings. If the distribution of ethnicities is skewed, so will be the eventual output.

Selection of Model Architecture

The ‘model’ is essentially the software that interprets the data and generates the images. There are numerous models in use. The choice of model is critical, it determines the kind of images the AI art generator can produce. In practice, the model may have several components. A base model might be trained on a large database of images, while a fine-tuning model is used to direct the base model output towards a particular aesthetic.

Training

During training, the AI model learns to capture the underlying patterns and features of the images in the dataset of artworks. How this is done depends on the model in use. It seems, however, that they all depend on a process of comparing randomly generated images with the dataset and refining the generated image to bring it as close as possible to the original

Generating Art

After training, the AI can be used to generate new art. This is a significant task in its own right. The app’s AI model needs to understand the semantics of the text and extract relevant information about the visual elements mentioned. It then combines the information extracted from the text prompt with its own learned knowledge of various visual features, such as objects, colours, textures, and more. This fusion of textual and visual features guides the model in generating an image that corresponds to the given prompt.

Fine-Tuning and Iteration

There is a skill in writing the text prompts. The writer needs to understand how the text to image element of the app works in practice. In use, therefore, there is often a need for fine-tuning. Artists may adjust the prompt or other parameters to achieve the results they have in mind. Feedback from this process may also help in development and refinement of the model.

Style Transfer and Mixing

Some AI art generators allow for style transfer or mixing. The AI will generate a new image based on the content of a specific piece, but in the style of another.

Post-Processing

The generated image may then be subject to further post-processing to achieve specific effects or to edit out artefacts such as extra fingers.

Is it really intelligent?

Many of these apps describe themselves as AI ‘Art’ generators. That is, I think, a gross exaggeration of their capabilities. There is little ‘intelligence’ involved. The system does not know that a given picture shows a dog. It knows that a given image from the training data is associated with a word or phrase, say dog. It doesn’t know that dogs have four legs. Likewise, it doesn’t know anatomy at all. It probably knows perhaps that dog images tend to have the shapes we identify as legs, broadly speaking one at each corner, but doesn’t know why, or how they work, or even which way they should face, except as a pattern.

Importance of training data

Indeed, in the unlikely event of a corruption in the training data, such as identifying every image of a dog as a sheep, and vice versa, the AI would still function perfectly, but the sheep would be herding the dogs. If the dataset did not include any pictures of dogs, it could not generate a picture of a dog.

On top of that, if there is any scope for confusion in the text prompts, these programs will find it. To be fair, humans are not very good at understanding texts either, as five minutes on Twitter will demonstrate. Even so, I’m sure that art AI will get better, technically at least. It will even learn to count limbs and mouths.

Whatever we call it, we know real challenges are coming. Convincing ‘deep fake’ videos are already possible. I’m guessing that making them, involves some human intervention at the end to smooth out the anomalies. That will change, at which point the film studios will start to invest.

We are still a long way from General AI though. An art AI can’t yet be redeployed on medical research, even if some of the pattern matching components are similar.

Is AI art, theft?

These apps do not generate anything. They depend upon images created by third parties, which have been scraped from the web, with associated text. It is often claimed that this dependency is plagiarism or breach of copyright. There are several class-action lawsuits pending in the US, arguing just that.

Lawsuits

These claims include:

  • Violation of sections of the Digital Millennium Copyright Act’s (DMCA) covering stripping images of copyright-related information
  • Direct copyright infringement by training the AI on the scraped images, and reproducing and distributing derivative works of those images
  • Vicarious copyright infringement for allowing users to create and sell fake works of well-known artists (essentially impersonating the artists)
  • Violation of the statutory and common law rights of publicity related to the ability to request art in the style of a specific artist

Misconceived claims

It is difficult to see how they can succeed, but once cases get to court, aberrant decisions are not exactly rare. For what it’s worth, though, my comments below. (IANAL)

  • The argument about stripping images of copyright information seems to be based on an assumption the images are retained. If no version of an image exists without the copyright data, how is it stripped?
  • The link between the original data and the images created using the AI seems extremely tenuous and ignores the role of the text prompts, which are in themselves original and subject to copyright protection.
  • A style can not be copyrighted. The law does not protect an idea, only the expression of an idea. In prosaic terms, the idea of a vacuum cleaner cannot be copyrighted, but the design of a given machine can be. If a given user creates images in the style of a known artist, that is not, of itself, a breach of copyright. If they attempt to pass off that image as the work of that artist, it is dishonesty on the part of the user, not the AI company. This is no different to any other case of forgery. Suing the AI company is like suing the manufacturer of inks used by a forger.
  • If style cannot be protected, how can it be a breach to ask for something in that style?

Misconceived claims

Essentially, the claims seem to be based on the premise that the output is just a mash-up of the training data. They argue that the AI is basically just a giant archive of compressed images from which, when given a text prompt, it “interpolates” or combines the images in its archives to provide its output. The complaint actually uses the term “collage tool” throughout, This sort of claim is, of course, widely expressed on the internet. It rests though, in my view, on a flawed understanding of how these programs really work. For that reason, the claim that the outputs are in direct correlation with the inputs doesn’t stand. For example, see this comparison of the outputs from two different AI using the same input data.

As the IP lawyer linked above suggests:

…it may well be argued that the “use” of any one image from the training data is de minimis and/or not substantial enough to call the output a derivative work of any one image. Of course, none of this even begins to touch on the user’s contribution to the output via the specificity of the text prompt. There is some sense in which it’s true that there is no Stable Diffusion without the training data, but there is equally some sense in which there is no Stable Diffusion without users pouring their own creative energy into its prompts.

In passing, I have never found a match for any image I have generated using these apps on Google eye or Tineye. I haven’t checked every one, only a sample, but enough to suggest the ‘use’ of the original data is, indeed, de minimis, since it can’t actually be identified. Usually I would see lots of other AI generated images. I suspect this says more about originality than any claims to the copying of styles.

I suppose, if an artist consistently uses a specific motif, such as Terence Cuneo’s mouse, it could be argued there was a copyright issue, but even then I can’t see such an argument getting very far. If someone included a mouse in a painting with the specific intent of passing it off as by Cuneo, that is forgery, not breach of copyright.

Pre-AI examples

This situation isn’t unique. Long before AI was anything but science fiction, I saw an image posted on Flickr of a covered bridge somewhere in New England. The artist concerned had taken hundreds, perhaps thousands of photos of the same bridge, a well known landmark, all posted on Flickr, and digitally layered them together. He had not sought prior approval. The final image was a soft, misty concoction only just recognisable as a structure, let alone a bridge. The discussion was fierce, with multiple accusations of theft, threats of legal action etc.

In practice, though, what was the breach? No one could positively identify their original work. Even if an individual image was removed, it seems highly unlikely that there would be any discernible impact on the image. I would argue that the use of images from the internet to ‘train’ the AI is analogous to the artist’s use of the original photos of that bridge. In the absence of any identifiable and proven use of an image, there is no actionable breach.

Who has the rights to the image?

An additional complication, in the UK at least, stems from the fact that unlike many countries, the law makes express provision for copyright protection of computer-generated works. Where a work is “generated by computer in circumstances where there is no human author, the author of such a work is “the person by whom the arrangements necessary for the creation of the work are undertaken”. Protection lasts for 50 years from the date the work is made.

It could be argued that in the case of AI art packages, the person making the necessary arrangements is the person writing the text prompt. As yet, that hasn’t been tested in a UK court.

See Also

A paper produced by the Tate Legal and Copyright Department. I can give no assurance it is still current.

https://www.tate.org.uk/research/tate-papers/08/digitisation-and-conservation-overview-of-copyright-and-moral-rights-in-uk-law

Is AI use of training data, moral?

Broader issues of morality are also often raised. There are two aspects to this.

There are moral rights within copyright legislation. Article 6bis of the Berne Convention says:

Independent of the author’s economic rights, and even after the transfer of the said rights, the author shall have the right to claim authorship of the work and to object to any distortion, modification of, or other derogatory action in relation to the said work, which would be prejudicial to the author’s honor or reputation.

If the use of a specific work in an AI generated image cannot be identified or even proven to be there in the first place, it is difficult to believe that its use in that way is ‘prejudicial to the author’s honor or reputation.

Broader morality

There is also a broader moral issue. Is it ethical to use someone else’s work, unacknowledged and without remuneration, to create something else. As with all moral argument, that is without a definitive answer. This Instagram account is interesting in that respect.

There is a fine line between taking inspiration and copying. That line is not changed by the existence of AI. Copying of artistic works has a long tradition. As Andrew Marr says in this Observer article, “the history of painting is also the history of copying, both literally and by quotation, of appropriation, of modifying, working jointly and in teams, reworking and borrowing.”

The iconic portrait of Henry VIII is actually a copy. The original, by Hans Holbein, was destroyed by fire in 1698, but is still well known because of the many copies. It is probably one of the most famous portraits of any English or British monarch. Copying of other works has also been a long-standing method of teaching.

Is it acceptable to sell copies of other peoples work?

That of course begs the question of whether AI art is a copy. Setting that aside, it also takes us back to the issue of forgery, or the intent of the copyist. For many years, the Chinese village of Dafen is supposed to have been the source of 60% of all art sold worldwide. Now the artists working there are turning to making original work for the Chinese market. Their huge sales of copies over the decades, suggests that buyers have no objection to buying copies. None of those sales pretended to be anything but.

Giclée a scam?

Many artists sell copies of their own work, via so-called ‘giclée’ (i.e. inkjet) reproductions. The marketing of these reproductions often seems to stray close to the line, with widespread use of empty phrases, even if impressively arty sounding – ‘limited edition fine art print’ and similar. I’ve even seen a reputable gallery offering a monoprint in an edition of 50. There was no explanation, in the description, of how this miraculous work of art was made. It was of course an inkjet reproduction. To be accurate, there was an explanation, but it was on a separate page. There was no link from the sale page.

Ignoring the fact that these are not prints as an artist-printmaker would expect to see them, the language and marketing methods used, are designed to obscure the fact that these are not, of themselves, works of art, but copies of works of art.

In that context, I believe an anonymous painted copy of a Van Gogh to be more honest about what it is, than an inkjet reproduction of an oil painting, by a modern minor artist. It at least has some creative input directly into it, whereas the reproduction is pretty much a mechanical copy. I’ll return to this in Part 3.

Bias in AI art

The possibility of bias in AI in general is a real cause for concern. In the specific case of AI art, the problem may be less immediately obvious, but as AI art is used more widely, the representations it generates will become problematic if they are biased towards particular groups or cultures. One remedy would be to increase transparency about data sources. If the datasets used to train AI models are not representative or diverse enough, the AI output is likely to be biased or even unfair in the representations created.

Issues likely to affect the dataset include:

  • A lack of balance in the representation of gender, age, ethnicity and culture
  • A lack of awareness of historical bias, which can then become replicated in the AI
  • If labels attached to images during preprocessing or dataset creation inaccurately describe the content of images or are influenced by subjective judgments, these biases may be perpetuated in the model.
  • Changes to the AI model after deployment may introduce bias if not properly managed and documented.

Lack of transparency may lead to other problems:

  • AI systems, often work as “black boxes,” They provide results without explaining how those results were obtained.
  • Difficulty in meeting regulatory requirements such as on data sources
  • Poor documentation of the data sources and data handling procedures, preprocessing steps, and algorithms used in the AI.
  • Inability to demonstrate the existence of clear user consent mechanisms, and adherence to data protection regulations (e.g., GDPR)

These can all lead to poor accountability and lack of trust.

How does this relate to AI?

AI, as it currently stands, does not copy existing works. Nor does it collage together parts of multiple works. Somehow, and I do not pretend to understand the technical process, it manages to generate new images. They may become repetitive. They may, especially the pseudo-photographs, reveal their AI origin, but despite all that they somehow produce work which is not a direct copy – i.e. original.

For the future, much depends on the direction of development. Will these apps move towards autonomy, capable of autonomous generation of images on the basis of a written brief from a client? Or will they move towards becoming a tool for use by artists and designers, supporting and extending work in ‘traditional’ media? They are not mutually exclusive, so in the end the response from the market will be decisive.

Posted on Leave a comment

Thinking about size

If you go to any gallery show, the chances are you will see some huge paintings. Indeed, some are so big they might equally be described as murals. How much of that comes from the artist wanting more space in which to work, I wonder? It’s a value judgement, of course, but too often with contemporary art it seems to me to be a substitution of size for content.

Of course, simple marks can be transformed by increase in scale. Compare a single brush mark on a post card piece of paper with one drawn with full arm extension on a large canvas. In the first case, that mark is generally seen as a line. It marks off one area of the image from another without much in the way of detail. At the larger scale though, it becomes an element in its own right, it has internal structure and texture. That doesn’t mean that the huge sweeping mark is better than the small line. It’s just different. Increasing scale doesn’t, of itself, add artistic value, but it does open up the possibilities.

Contemplation

I read a quote from Mark Rothko about this. He wanted people viewing his work to enter into them.

The reason for my painting large canvases is that I want to be intimate and human. To paint a small picture is to place yourself outside your experience, to look upon an experience as a stereopticon view or with a reducing glass. However you paint the larger picture, you are in it. It isn’t something you command.

Mark Rothko https://www.markrothko.org/quotes/

He saw them as objects of contemplation, as an experience in itself rather than a record of an experience. Rothko’s paintings are indeed very large. Sitting close to them, as he wanted, it is easy to lose the world beyond the canvas. It fills your peripheral vision. After a while, the world beyond the picture fades away.

That experience is indeed a function of the scale. I tried making some versions of these works on 10” x 10” canvases. I don’t know how Rothko constructed his pictures. My approach was to build up layer after layer of colour in varying shades, wiping back, adding more colour and so on. What I got seemed to be a reasonable approximation of a Rothko surface. My relationship with it was, however, very different.

Analysis

This seems to be distinctive to Rothko, for me anyway. I had nothing similar, for example sitting in front of one of Gerhard Richter’s Cage series. To be fair, that was in a crowded show, not in a normal quiet gallery. I think there is just so much going on in the ‘Cage’ paintings that it’s easy to find oneself deconstructing them, working out how the layers lie over one another. This seems to be an analytical rather than a contemplative relationship. One of these days I intend to sit again in front of the ‘Cage’ series and see what happens.

I missed the exhibitions at the RA in London of work by Richard Diebenkorn and of the Abstract Expressionists. It is unlikely I’ll get another opportunity to see their work and make the same comparison. Watching film of Pollock at work, it is however possible to imagine achieving a similar contemplative state. His movements as he paints are like a ritual dance of some sort, with the marks and drips are forming a record of it, like some form of dance notation.

Deference

Modern art is not the only place to find the monster canvases, of course. David’s painting of “Napoleon Crossing the Alps” is about 2.6 m x 2.2 m. (There are five versions, all slightly different sizes.) Napoleon refused to sit for it, saying that the character that comes through was more important than the likeness. The resultant painting is meant to impress, to dominate the viewer. It is about Bonaparte the Emperor than Bonaparte the man.

So-called ‘History Paintings’ are all large scale. The term typically refers to any picture with a high-minded or heroic narrative, as illustrated by the exemplary deeds of its figures. The message must however be edifying and worthy of depiction.

Although perhaps not strictly speaking ‘history paintings’, the apocalyptic works of the Victorian painter John Martin also come to mind. Similarly, the revelatory, epic landscapes painted in nineteenth-century America. In all these works, the role of the viewer is almost to pay homage.

I’ve identified three distinct modes of relationship between the viewer and the very large painting; contemplative, analytical and deferential. I’m sure there are other ways of thinking about this relationship. In particular, I’m not sure if my ‘analytic’ response is driven by me being an artist myself.

I wonder though. Would a non artist react in the same way as I did to the works by the likes of Richter or would they just lose themselves in the colours and shapes?

What do you think?

Posted on Leave a comment

What does it mean – part 3

Fourteen Discs - painting by Patrick Heron

This is the paradox that lies at the heart of the mystery of artistic creation. The meaning which a work of art has for society is not the same meaning that the artist was conscious of putting into it. This is because a work of art is not just a telephone exchange which facilitates straightforward communication. The work of art is in some profound sense an independent, live entity. It has its own life. It draws nourishment from its creator that he was totally unaware of having put into it: and it redistributes nourishment to the spectator (including the artist himself, for he is merely a spectator once the work is completed). What the work itself communicates is a transformation of all that the artist was conscious of investing in it. Its success or failure, as a transmitter of thought or emotion, simply cannot be planned and guaranteed beforehand. This is why I say that to demand a certain result from art in advance is utterly to misconceive the central creative process itself. It is to suppress spontaneity: to batten down on the subconscious.

Patrick Heron, Art is Autonomous
First published in The Twentieth Century, September 1955.
Reprinted in Painter as Critic, Patrick Heron: Selected Writings
Editor: Mel Gooding, Tate Gallery Publishing, 1998
ISBN 1-85437-258-0

More on Patrick Heron: https://www.tate.org.uk/search?q=Patrick+Heron