What is Ardour's Generative AI policy?

Software is a creative field in and of itself, and generating code is no better then audio or images. I respect @x42 and @paul not only for the quality of Ardour as it stands, but the continual effort to make it better and listen to user feedback. Those two, as well as contributors who’ve come and gone, all leave their mark on such a project in their decisions, ideas, how those ideas are implemented and optimized, everything. Programming is an art form, one I appreciate every day as I continue to suck at it!
Having only recently learned what Ardour mean, I think it lives up to its name. You can tell its made with love when it gives you a silly plugin to turn your keyboard into a damn telephone, and when you look to the description for a reason why, you get an ET reference. “Gen” “AI” might be able to licit confusion and much worse, but it’ll never be charming like that, let alone produce anything worthwhile.

1 Like

I was curious about YouTube, so i did a quick google search. It only took one question, the info is not even hidden, yet, so many people are not even aware of it. To fools this looks like world of opportunities, freedom, progress. Guess i should have known all along… It was so logical & obvious.
And now what? Go ahead, be an damn artist, be an “content” creator, create something unique, waste more of your life :slight_smile: .

So that appears to be an AI summary response, so while it may be correct, I would look for a better reference as well as to what their policy is if you are trying to determine this.

Then again what Youtube does or doesn’t do isn’t necessarily on topic for this honestly.

  Seablade

Indeed, and not just limited to medical, but that opens a different can of worms. In your example it could turn doctors into an accountability sink. A key difference is if you choose to use a AI as another tool in your set, or if it is forced upon you. The latter is indeed something where I would subscribe to your boycott of AI, but in general I don’t agree with a ban of generative AI per se.

-=-

Anyway back to audio: There was a nice presentation by Fares Schulz here at the Berlin Audio Dev meetup with nice overview:

In fact the first 8 slides are a nice history of Neural Music Production since the the late '50s: Advances in Neural Music Production

Some of the approaches are very creative themselves!

1 Like

I have lately bitten the bullet and familiarized myself with AI deeply.

AI is just a new clever way of grouping close together concepts and things that belong together. When AI is asked a question it will be split the question into coordinates that map to those concept groups in AI’s memory and this is how it is able to produce text about those things. The same goes with music and pictures and videos, it predicts notes, sounds, pixels, etc.

It is just mechanics and a very clever human invention of how to group knowledge in a new way that makes that knowledge much much more usable.

Previously I felt threatened by AI because it might take away the things I love: music and coding. After taking time to learn AI I feel that it is just a tool for humans to use and it lowers the threshold for entering a field. I can compose a song and use AI just as a group of talented studio musicians to realize my vision of it. I tried it and it works very well. It is still my composition and lyrics and only “played and sung” by AI. Discovered that process from this video.

If you remember when punk - music was born (if you are old enough :slight_smile: ). It lowered the threshold for common people to enter music, since it kind of gave you the permission to make songs even when you weren’t great at playing an instrument or singing. The same happened with rap, it was easier to enter music because one didn’t need to go through the long process of learning to play and understand music. Now AI is doing the same and lowers the threshold entering music.

AI is just a tool to help anyone achieve a goal without a long learning process. You can use it where your skills lack to fill in the gap and help you be more creative. For example my singing is poor but I can play a couple of instruments. So I’m planning to only use AI by giving it my vocal track to make it pitch perfect and record everything else myself.

3 Likes

While I am not thrilled at all about generative AI I also faced the fact it is here to stay and to be ignorant of it especially as a person in late middle age could definitely put you at a disadvantage in the future in too many ways to fathom. I don’t need AI to make music or graphic art but I am not very good at coding and as someone who has never been great with advanced mathematics I really don’t think I have the knack for it but I do know exactly what I’m looking for in a technical sense. AV Linux is full of script snippets to do various things so I decided to bring these ‘napkin sketch’ utilities to AI and (a) improve them and (b) learn in detail about what steps were involved in the improvements so I could at least read and understand the code better. To be honest I can’t really describe it as easy, even when I had a very defined idea of what I wanted the AI (Google Gemini 3) would often need 5 or more iterations to get to a fully operational running script, if things were not going well it would panic-add redundant features and over-engineer things and I often had to tell it to start again from an early iteration. Even when I told it specific details like icon names and locations it would forget them occasionally and it would undo some of my script edits even when asked to retain them in new iterations. Even though I was signed in for it to keep a ‘memory’ of our previous conversations and projects it would often need reminding and wouldn’t carry relevant ideas into newer scripts unless asked. I felt like I had to essentially re-introduce myself and what my goals were at every new session. This all sounds like it was inept and useless but this is not the case, it just needed a lot of detail and the onus to be organized and keep track of the fine details was definitely my job, as was the in-depth testing. In the end I got a really great set of utilities and although a lot of time and testing was required I ended up with something that I definitely couldn’t have done unassisted. I also learned a lot about Gemini’s current strengths and weaknesses which will make me more effective at prompting it in the future. To get unlimited help with a bash script or the expertise to fix a stubborn theme css file without asking or bothering a human is something I think I will find great value in in the future especially as a spare time Distro maintainer. Overall I’m very glad I set aside my prejudices (and fears) and saw firsthand what using AI is like.

2 Likes

Good point/parallel, but as a self-proclaimed punk rocker, this dosen’t sit right with me.
I listen to various derivates of punk since i was in 3rd grade of elementary school. I like things like The Ruts, Dead Kennedyes, Bad Brains, Ramones of course, The Buzzcocks, PTTB, The Blood, The Damned, Agent Orange and various variants forms of artsy or HC punk. You could say punk rock is my home (aldo i do listen to mostly anything, i don’t realy consider myself genre-centric).
The whole point of that movement was to do it yourself, and the rest was in the hands of faith. Not the machine, not the robot…you. Best to your ability, so it often becomes cute, commical, annoying or straight up awful :slight_smile: .
Look, i used AI already in various ways - used Suno Studio, used AI vo, AI video creation tools etc. I might be forced to use it even more in the future, and when commercial aspect is in question, i don’t mind/care, but i simply don’t see the point in using it in my own creations.
If it comes up with the melody or part of it, that’s not my melody anymore. If it rearanges my arangement, that’s not my song anymore. If it rocks on drums and make the song shine, that’s not realy my recording anymore. And the inevitable part - it stores all of that “colaboration” on a server, are reuse it when prompted by someone else. The whole point of creating something, especialy if you strive for uniqness, becomes questionable. It’s all becomes watered down, stripped of esence, meaningles, an “okay, whatever”.
And on top of that, it’s literaly steals from contemporary artists, and we should all accept that as normal. I’m not going to pretend i’m okay with something like that.

5 Likes

Your point about accountability is fair. I’m not here to argue for AI, obviously. The medical side of it isn’t something for me to figure out I just hope that whatever genuine use it does have, the “AI” they use isn’t the “AI” we have now, and is actually ethical.

I digress.

I don’t see how AI being forced or used at will makes a difference in this case. My points about gen AI being unethical still apply across the board, which I say not to be rude, I just don’t see how any of us should accept “gen” AI into Ardour in any form.
The whole concept has only gotten this far because people roll over, saying we must “get used to it” or be left behind. I apologize for the extreme example but… you know what they say about a Nazi bar? Software works similarly, and we best keep tech bro “AI” nonsense far away from open source, as it goes against the whole concept of FOSS in the first place. FOSS is about lifting each other up and trying to create things by people, for people, with the help of other people. “Generative” AI is the antithesis of that, and I don’t really want to ramble more because I’ll end up repeating myself.

If you are concerned about AI, please, please keep Ardour completely free of it, in code and functionality. It would give us much more faith in the project for this action to be taken, and honestly, it’d simply make a lot of people happy to see from their favorite DAW. <3

As someone who loves punk music and all it stands for, it should be obvious that “gen” AI spits in its face. If you think a community that welcomes inexperienced musicians to learn and grow, not mocking them for their lack of form or knowledge, is comparable to asking maliciously crafted software for audio built from or assisted by metric tons of stolen art, I don’t know what to say.

"If you remember when punk - music was born (if you are old enough :slight_smile: ). It lowered the threshold for common people to enter music, since it kind of gave you the permission to make songs even when you weren’t great at playing an instrument or singing. The same happened with rap, it was easier to enter music because one didn’t need to go through the long process of learning to play and understand music. Now AI is doing the same and lowers the threshold entering music.

AI is just a tool to help anyone achieve a goal without a long learning process."

Where is the joy in learning or creating anything? I’ve had a lot of trouble with the education system over the years, and I’m sure I’m making no headlines by saying it needs copious reform (and should be free to all), but if I had decided to farm it out to a plagiarism machines even part of the time, not only would all my “work” have been invalidated, but I would have learned nothing, and I would have wasted hours upon hours of my professor’s lives with slop I didn’t even create.

I’m trying to remain calm, but the idea that the learning process for music, visual art, code, whatever, is something to be skipped over rather then relished in is an insult to all creatives.

2 Likes

I’m so sorry you’ve been forced to use AI in the past, for commercial things or otherwise.

Rather the opposite. Free as in freedom software does not discriminate against any use. One of the fundamental rights is to allow anyone to study and modify it. Banning AI as tool would void that freedom.

Similarly someone may make a song with Ardour to promote message that I disagree with and they’re still welcome to do that.

All that being said, it’s unlikely that Ardour will have an LLM generated code anytime soon, and not just because of (C) reasons. There are also no plans to directly integrate any genAI, though interfaces for 3rd parties to use at their digression are welcome.

2 Likes

Robin, with all due respect, why single out one phrase of my message and not respond to anything else? Yes, the concept of FLOSS is that anyone can study or modify it. However, just as somebody might release a video on YouTube for free consumption or I might write a post on social media, such things being public does not make them okay to throw into a plagiarism machine that was literally invented to devalue human creations more then they already are. The person having their work or likeness regurgitated is right to be upset, even if they did everything for free.
No, you can’t control what others do with your work, but that’s not exclusive to FLOSS software, it goes for nearly everything. Regardless, if my tools were used to create something awful, I would call it so. I wouldn’t shrug it off because the F stands for freedom, in the same way you wouldn’t ignore someone hurling baseless insults around because “free speech” allows them to. You’re right, we should all be free to use Ardour and its code how we want, just as we can use our voices. That does not make it any better to abuse either, and doing so should net such a person rightful pushback.

Why are you dedicated to open source tech? Is it privacy? AI’s a nightmare for that. The ability to learn from others’ code? AI spreads only misinformation and (to be generous) calculated guesses at the truth. Is it the ability to leave big tech companies behind, using community driven software for your endeavors that you can actually trust? AI is big tech companies. Any output from a “generative” LLM is nothing more then stolen material run through a planet-killing black box which, considering you can’t see what its even trained on, makes it arguably worse then most pre-AI closed source software. I don’t see how these things align with anything FLOSS.

someone may make a song with Ardour to promote message that I disagree with and they’re still welcome to do that.

I effectively responded to this over on GitHub. If a song of mine got used in a fear-mongering propaganda ad or something, I’d be right to call it out, even if copyright was somehow against me. Likewise, if my software was scraped or used in some slop project, I’d make my stance on such things known, if nothing else. I certainly wouldn’t consider myself open to adding AI or even “interfaces for 3rd parties to use” because other forks will add them regardless. They are not Ardour, they are a modified version of Ardour controlled by other people. Like I said in that comment, its about what you can control, and Ardour is controlled by Paul and yourself. We use Ardour as opposed to a fork of it because we respect and trust you two.

As @x42 noted, we have zero plans to incorporate generative AI stuff into Ardour, and zero plans to include “purely LLM generated code” into Ardour. We are not seeking to expand our “AI capabilities” or “AI integration”, and we’re not seeking to make it easier for AI tools to use Ardour than it already is (which is actually pretty easy).

We’ve stated on the development page that we cannot accept LLM-generated code into Ardour because it cannot be copyright and thus cannot be licensed. There’s significant wiggle room here because no court has yet established how much work a human has to do in order to tranform it from a machine-generated work into a human one. We will watch that space with interest.

The MCP server was added, as @x42 explained above, to potentially facilitate better interactive use of Ardour by people with visual impairments of various kinds.

I’m not really sure what it is that you want us to do.

4 Likes

Oh, that’s easy…
We are not shure either :slight_smile: .

Let’s see…
For starters, do everything opposite of Suno, and simply go Paul full blast, i suspect that’ll do :slight_smile: .

If somebody comes and demands some generative AI feature, just release the tomahawk from you hand, it’ll find it’s target :slight_smile: .

2 Likes

That long post was mostly in response to the statement that AI isn’t antithetical to FOSS, an idea I also addressed in a reply to @x42 on GitHub that went ignored. Perhaps you preferred to respond to me here, Robin, which is fine, and maybe that reply was left before I was done with my own aforementioned one on GitHub.
From my perspective, however, it appeared that you saw my response over there, ignored it, then reiterated the point in this thread as if I hadn’t just attempted to engage with it in another. Additionally, the only acknowledgment of what I said on GitHub was directed at the pull’s creator, urging them not to be discouraged by my words. I didn’t intend, nor see how I was rude or disrespectful, so that stung a bit. Jumping back here to a comment similar to the one I just addressed, I felt the need to elaborate further since my last reply seemed to have been ignored.
Hence, long message. Hence, I likely came off as overly upset or spiteful. I was the former, but if none of what I described was intended, I’m genuinely sorry for all of that.

The MCP server was added, as @x42 explained above, to potentially facilitate better interactive use of Ardour by people with visual impairments of various kinds.

I will never downplay the importance of accessibility, but as someone with a disability that makes it more difficult to create visual art, it would not justify me stetting ethics aside to make a computer regurgitate a mishmash of other people’s creations. I’ll admit I’m not sure how best to approach accommodations for the visually impaired, but I doubt anything AI related has to be involved.

we cannot accept LLM-generated code into Ardour because it cannot be copyright and thus cannot be licensed. There’s significant wiggle room here because no court has yet established how much work a human has to do in order to tranform it from a machine-generated work into a human one. We will watch that space with interest.

While once again, I am grateful for your apprehensiveness towards “generative” AI, you’re highlighted the point I’ve been trying to make. If you’re concerned about the effects of this harmful tech being pushed by the kind of companies FOSS is meant to oppose, I fail to understand why copyright is the only concern. I wouldn’t put money on governments making half-decent decisions regarding “gen” AI if it existed decades ago, let alone now, given the state of the world. Does this mean Ardour will embrace such things if the law lets it? I’m not trying to put words in your mouth, this is why I recommended elementary OS’ policy as a good example all the way back in my first message. Maybe those details got lost in all my yapping. I’ve been told I haven’t shut up since day 1 so I guess old habits die hard. Apologies.
I’m not trying to attack or intimidate you, I don’t intend to be an annoyance, and I don’t want you to wince when my name appears in these forums. I only want to see the most valuable project I’ve used since going full FOSS, a piece of creative software, reject machines that exist only to further disrespect creatives and our planet. That’s all.

It’s not the only concern. It is, however, the simplest concern: no need to do any analysis of the pros and cons of any AI platform or process - we just can’t do it, because it violates our own license. If that issue didn’t exist, we would certainly have to have a more evolved position on the much more complex issues. But it does, and it acts as a convenient barrier.

If the law were to change to allow machine generated code to be copyrightable (and thus licensable), we would put more time into considered the broader issues. But right now, in the USA at least, that is not the case, and I’d rather work on bugs and features.

I doubt anything AI related has to be involved.

Well. maybe but maybe not. If you wanted to build an interactive front-end that allows a person to take control of Ardour from speech or text inputs, it would be a huge task if you chose to do it any other way than the LLM way. That doesn’t make the LLM way “correct”, but I wouldn’t be so confident that other approaches are sensibly realizable.

Regarding “directed at the pull’s creator” … the only people who set “Ardour’s Generative AI policy” are @x42 and myself, not PR creators or forum commenters, so while it may be valuable to persuade others of the deep problems with LLM technologies, doing so doesn’t have much to do with Ardour’s own policy.

I see, I understand your focus remaining on the software itself itself for now. If LLMs never existed, I’m sure we’d be able to find a way to create these accessibility functions otherwise. I know similar things have been done before actually, but regardless. Thank you for responding, I appreciate it.

Highly unlikely and not for a lack of trying. We failed to do that for 20+ years. There was a very early prototype ages ago, but that not viable to maintain with evolving features.

I also like the fact that LLMs can be used as studio-assistant. Your new intern can set up routing, and do basic session tasks…

Personally I am looking forward to seeing how well it can handle certain aspects of session analysis for my own purpose, again nothing to do with generating work at all, everything to do with analyzing existing work. Could I do it without an LLM? Sure, in fact I already do, but it takes hours I don’t have to spare to do so and I probably still don’t go as in depth as I could.

 Seablade

So can lua scripts :wink:

But you’re right, an LLM is way more convenient and accessible than the LUA API. Not everybody has the time and technical understanding to write a script that does the right thing in Ardour. Even LLMs fail at that task so far, I read a few times here.

Pretty sure that many fear that the AI will overstep in its intern role and starts either interupting or taking over the creative process. A script runs and shuts up afterwards, or is only triggered on clearly defined signals from the session.

In my opinion, the assistant would need some very hard and easily definable borders, like some checkboxes in the ardour settings what it can and can’t do, not some file in plain english asking it not to overstep