Learn about Gnash, Geoclue, Firefox, social desktop and more on the NLUUG autumn conference on Oct 29 in Ede (NL).

Create an account

Subscribe to LWN

Return to the Development page

Verio's Linux VPS now up to 50% off! Get root access, SLAs, dedicated CPU, bandwidth & memory. Click for details.

Ext3 and RAID: silent data killers?

LWN.net Weekly Edition for August 27, 2009

Pyjamas: writing AJAX applications in Python

SCO: not dead yet?

On properly packaging perl

Printable page

Weekly edition	Kernel	Security	Distributions	Search
Archives	Calendar	Subscribe	Write for LWN	LWN.net FAQ

LPC: Linux audio: it's a mess

September 18, 2008

This article was contributed by Don Marti

Audio is a fitting topic for the first day of the Linux Plumbers Conference. Users want sound to Just Work, and there's lots of working code in individual projects. But so far, it seems like nobody has everything quite plumbed together in an annoyance-free way.

Lennart Poettering, a lead developer of PulseAudio and Red Hat employee, moderated the miniconference and started with a summary of the state of Linux audio: "it's a mess." The audio miniconference came up with two steps toward cleaning up the mess, though. First, come up with a coherent story for application developers on what sound API to use, and how. Second, clean up the often-confusing array of user-visible audio level controls.

PulseAudio first appeared to regular users in Fedora, starting with version 8, and now, as Lennart puts it, is for up-to-date users, "the software that currently breaks your audio." PulseAudio is a sound server that mixes audio from multiple applications and passes it along to the sound hardware. It offers advanced features such as network transparency: an application can play a sound on a remote system, and PulseAudio makes it come out the speakers on the remote machine where the user is working. Supporting it shouldn't be a big change for most application developers to handle. It will handle applications written to the kernel's maintained audio API, ALSA, using the PulseAudio backend for alsa-lib. So the PulseAudio transition has been relatively painless for the distributions.

An earlier sound server project, the Enlightened Sound Daemon (ESD) sound server, is falling out of favor and Media Application Server (MAS) has never really caught on. However, one of the competing sound servers looks likely to remain. On the pro audio side, the low-latency sound server JACK is the recommended option. JACK, the "Jack Audio Connection Kit," as Dave Phillips writes, "holds the keys to the kingdom" for connecting studio applications such as the Ardour digital audio workstation and the Rosegarden MIDI sequencer. "If you want all of the features, no one audio system supports all of them," Lennart said.

Apple and Microsoft each have a single sound server that does both desktop and pro audio, but nobody at the session seemed to have much interest in that direction for Linux. PulseAudio is optimized for general desktop use and power savings, and supports scheduling features that should minimize wakeups but still allow for reasonably low-latency playback of streaming audio. It's also network-transparent and supports features such as placing desktop sound events based on mouse position. Network audio and desktop effects don't tempt pro audio users. JACK's uncompromising approach toward latency means it's likely to hog too much power to be acceptable to battery-life-watching desktop users, but fine for a studio with a rack full of gear. So two sound servers, one for pro and one for the masses, seems to be fine with both sets of users.

Abusing ALSA

PulseAudio, however, can't give applications direct access to the hardware, and currently only about 70% of ALSA applications use the API in a PulseAudio-safe way, Lennart said. Some high-profile applications are among those doing audio wrong. "Flash and Skype are really really broken applications, especially Flash," he said. Adobe split out the parts of its code that talk to the audio subsystem, and certain other plumbing, into an open-source library, libflashsupport. But Flash remains broken. The proprietary Flash library talks to libflashsupport from multiple threads, and one thread calls a destructor while another continues to send data. "It works until you close the browser window and then you get a race," Lennart said.

Developers who want to play audio have a sometimes-confusing choice of tools, including PortAudio and GStreamer. (PortAudio is cross-platform, which is likely why the popular cross-platform audio editing application Audacity uses it.) GStreamer is relatively feature-intense and heavyweight, also handling video and transcoding. (Write a player with Gstreamer and you get the ability to play your collection of C64 SID files for free.)

[PULL QUOTE: If someone comes and says, 'I want to write an audio application. Which API should I use?' I don't have a good answer END QUOTE]

"If someone comes and says, 'I want to write an audio application. Which API should I use?' I don't have a good answer," Lennart said. The current best answer seems to be to write to the PulseAudio-safe subset of ALSA. Jeff Licquia of the Linux Standard Base (LSB), in the audience, mentioned that ALSA is on track for inclusion in LSB 4.0, and is a trial use module for 3.2. LSB aims to define a compatibility standard for Linux applications, and aims to do the kind of application developer education that Linux audio developers seem to need. Applications seeking LSB certification must run all of the LSB tests, but can fail anything tagged as trial use. "We're only keeping the stuff that we hope will be around for the long term," he said. If the LSB-safe subset of ALSA fits into the PulseAudio-safe subset of ALSA, application developers could write to ALSA and test with LSB.

"I would like to be able to tell people to use libsydney," Lennart said. Libsydney, in progress, is intended to be a networking-friendly general-purpose audio API.

ALSA and the HD-Audio widget problem

In ALSA, the hardware/software interface is in good shape, but software to user interface needs some work. Takashi Iwai, a core ALSA developer and Novell employee, pointed out in a talk that the line count for /sound code in the kernel is actually shrinking, except for ASoC (system on a chip) and HD-audio. "There will be no more sound cards, especially PCI," he said. The one exception is the SoundBlaster X-FI for gamers, which is currently not supported well in ALSA. Creative announced proprietary drivers in 2006, but one ALSA developer recently did get access to a data sheet under NDA.

The new audio standard, HD-Audio, is commonly found on new systems, and it's well-supported at the kernel level. However, it's based on "widgets" with vendor-configurable I/O pins. A driver can't tell how the HD-Audio part is connected, so some Linux plumbing work is required to identify which of the many exposed level controls is the right one to show the user. An audience member pointed out the need to tweak multiple level settings on his hardware, to get the right level without distortion. Linux will need more information on how each machine has its HD-Audio hardware hooked up in order to reliably give the user a useful volume control.

(Log in to post comments)

LPC: Linux audio: it's a mess

Posted Sep 18, 2008 23:36 UTC (Thu) by Richard_J_Neill (guest, #23093) [Link]

Why have so many distros forced PulseAudio on the end user? PA isn't a neat feature, it's a CPU-hogging(*), irrelevant bug - and it has been deployed before it is even properly working! We have Alsa with dmix now, and that should be enough to totally obsolete the concept of a "sound server".

With PA, my 1GHz Mini-itx box goes from being about able to play DVDs, to unable to work without chronic skipping. PA wastes 30% of the CPU, to achieve nothing!

I can see how network-transparent sound can sometimes be useful, as can the ability to multiplex soundcards together. But for the common case (of allowing multiple apps to be mixed in software so that all can share the soundcard, without blocking each other), plain ALSA/dmix is sufficient.

LPC: Linux audio: it's a mess

Posted Sep 18, 2008 23:57 UTC (Thu) by nix (subscriber, #2304) [Link]

The 'PA chews CPU when doing nothing' bug (actually a feature implemented
in a CPU-chewy way) was fixed in PulseAudio 0.9.7, released nearly a year
ago. Upgrade!

PulseAudio's degree of CPU-hogginess is customizable: see
`resample-method' in daemon.conf. This is prominently marked in the
documentation.

alsa with dmix obsoletes the concept of a sound server iff you have only
one machine or never ever want to produce sound on any other machine, and
have only one set of speakers, and don't care about per-application volume
restoration or compatibility with things like esd. Some of us care about
that.

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 9:37 UTC (Fri) by dgm (subscriber, #49227) [Link]

> alsa with dmix obsoletes the concept of a sound server iff you have only
> one machine or never ever want to produce sound on any other machine, and
> have only one set of speakers, and don't care about per-application volume
> restoration or compatibility with things like esd.

I think you just described 99.9% of desktop Linux users.

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 11:07 UTC (Fri) by nix (subscriber, #2304) [Link]

What? Most Linux users don't care about e.g. muting apps that make annoying sounds while music is playing, or pushing up the volume of important alerts so they're audible over whatever else is going on? Just because they can't *do* it doesn't mean they don't *care*.

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 21:49 UTC (Mon) by lysse (guest, #3190) [Link]

I don't care. Do I get to be a data point?

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 23:21 UTC (Mon) by nix (subscriber, #2304) [Link]

I'll add you to my extensive and elaborate statistical baseline (from
which data is extracted using gut feelings and rand()).

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 16:48 UTC (Fri) by iabervon (subscriber, #722) [Link]

I think users almost all want independent volume controls for different audio-producing applications; but I think this is something that users, in general, think is just something computers can't do. (In particular, they want to turn down the volume on applications that just "make sounds" and therefore don't bother to include a control and on applications that want to draw the user's attention unwillingly and turn their own volume up)

I also think most users would like to play their alert sounds out the laptop speakers despite having nice speakers plugged in, but assume that's impossible (and I think it is for a lot of hardware; there's only one DAC that gets switched).

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 8:05 UTC (Mon) by ceplm (guest, #41334) [Link]

I thought so too, but now I am switching between USB headphones and loudspeakers all the time. Thanks PA!

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 12:57 UTC (Fri) by zooko (subscriber, #2589) [Link]

Sounds good to me! Where do I sign up? Oh, it already works. :-)

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 17:38 UTC (Fri) by Richard_J_Neill (guest, #23093) [Link]

> The 'PA chews CPU when doing nothing' bug (actually a feature implemented
> in a CPU-chewy way) was fixed in PulseAudio 0.9.7, released nearly a year
> ago.

When I said "Doing nothing", I do mean that I was playing a DVD at the time. PA was doing "nothing", in the sense of "adding no value", not in the sense of "being idle". I suspect you are right that killing off resampling would have solved the problem - but I didn't know that, and the default settings were wrong. It seems fundamentally daft to have a daemon handling sound at all.

> alsa with dmix obsoletes the concept of a sound server iff you have only
> one machine or never ever want to produce sound on any other machine, and
> have only one set of speakers, and don't care about per-application
> volume restoration or compatibility with things like esd. Some of us
> care about that.

Actually, that's what drove me mad about PA. I already have a nice setup, with most sounds routed to the default onboard soundcard (/dev/dsp), which connects to the monitor speakers. One or two applications are permitted to use /dev/dsp1, which connects via a USB soundcard to a hi-fi. PA makes it extremely hard to choose specific devices at the application level. For example, I play the BBC's listen-again real-audio streams by using this shell-script:

CACHE=500
DEVICE=/dev/dsp1
LOCATION=$(wget -O- $1 | tr -d '"') #An rtsp:// URL
mplayer -cache $CACHE -ao oss:$DEVICE "$LOCATION"

I'm sure there is a configuration GUI for this, but a few months ago, it just wasn't ready. PA may be a fine program, but it shouldn't be used by default till it has had some more testing, and polish.

BTW, if you want to play sound on a remote machine, ssh in, and use /usr/bin/play.

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 22:07 UTC (Fri) by nix (subscriber, #2304) [Link]

It seems fundamentally daft to have a daemon handling sound at all.

Then what do you suggest does mixing of stuff from multiple processes? You can't do it in the kernel, it's solid floating point at least (and more likely some flavour of SSE). You have to do it in some separate process.

btw, dmix forks off a daemon.

(Regarding the rest, maybe Lennart or someone can chip in: I use PA mostly for the network-transparency and volume-choice stuff and have only one sound card and set of speakers.)

LPC: Linux audio: it's a mess

Posted Sep 21, 2008 21:05 UTC (Sun) by mezcalero (subscriber, #45103) [Link]

Please note that ALSA dmix doesn't actually fork a daemon off anymore. Since quite a while actually.

ALSA dmix is a pretty nifty piece of engineering. It allows multiple programs to write data to the audio device hardware buffer simultaneously in a very elegant way. However, this design nonetheless limits the possibilities of it quite a bit. For example, doing stuff such as "glitch-free" directly in dmix would be very difficult.

LPC: Linux audio: it's a mess

Posted Sep 21, 2008 21:02 UTC (Sun) by mezcalero (subscriber, #45103) [Link]

In PulseAudio you can switch any playback stream on-the-fly to a differnt device. No command line interaction necessary.

Also, each sink/source in PA has a unique name. You can use it as device string from the command line if you wish.

Sure, adopting PA is a bit of a departure for a couple of old-style Unix philosophies (such as "everything is a file" -- which is a pretty stupid philosophy anyway). But complaining about that is not really a technical argument to me. If we want to have a good, modern sound system for Linux than leaving some Unix legacy behind is the right thing to do.

Lennart

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 0:03 UTC (Mon) by nix (subscriber, #2304) [Link]

'Everything is a file' is a really really really really killingly useful
philosophy: or, rather, 'everything you can usefully manipulate can
produce an fd, and everything with a name is a file'. To see the problems
when this is forgotten, consider the horror which is SysVIPC.

I'm not sure how to apply this to PA though. Maybe a FUSE module reifying
the sources and sinks...

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 0:17 UTC (Mon) by mezcalero (subscriber, #45103) [Link]

"Everything is a file" is crack. It's an attempt of an abstraction that just doesn't work. A file has completely different semantics than sound device. Trying to cramp audio stuff into file-like semantics results in APIs such as OSS.

fds as a handle for resources are useful. But cramping everything into read()/write() style interfacing and access control via Unix permissions is just too simple, and incompatible with all the modern things we want to do with our desktop experience now.

Lennart

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 7:05 UTC (Mon) by nix (subscriber, #2304) [Link]

I don't seen any incompatibility. The thing behind the file need not be a
raw sound device: it can be another abstraction. OSS's interface sucks,
but that was in large part because it relied on things like ioctl() and
mmap() that just can't be usefully virtualized (as you know much better
than me).

Just select() alone makes fds-for-everything a useful philosophy. I have
no *clue* why you imagine that 'modern things' require splintering a
single consistent interfaces into a mass of incompatible and
non-interoperable interfaces.

(FWIW, everything-is-a-file, the singly-rooted filesystem hierarchy, and
pipes were *the* key abstractions that Unix was designed around. Discard
them and you just don't have Unix anymore.)

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 10:55 UTC (Mon) by mezcalero (subscriber, #45103) [Link]

So, you want to have an API that follows everything-is-a-file and then not use ioctl()s for configuration of the sound device? How else would you would like to see that happen?

ALSA internally doesn't even use read()/write() anymore. It uses ioctl()s everywhere as a generic function call multiplexer. That we have to do this is due to the everything-is-a-file philosophy and is basically just a hack: you lose type-safety and everything.

Unix access control is too simple: we want that access follows the user that is logged in on the local VT: when he logs out he needs to be removed from the access. Unix doesn't allow that, because if someone gained access he can keep things open as long as hew wants.

I know that e-i-a-f is one of the core fundamentals of Unix. But you know what? It's also one of the core weaknesses of Unix.

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 20:46 UTC (Mon) by nix (subscriber, #2304) [Link]

ioctl() is awful, everyone agrees. It's not virtualizable, it's not 32/64
compatible, it's not typesafe (as you said), it's thoroughly opaque...

The way to structure this is to put your controls in a directory, or a set
of directories, one file per control, rather than using ioctl() on a
single file with different requests. The way you solve the 'remove this
user' is to create these directories on the fly, and zap their contents
when the user logs out: even if the user retains a handle to the
directory, it's empty now so they can do nothing with it. (You might need
a simple non-device revoke() to do this properly, such that all I/O to an
ex-session fd returns -EIO, but this has already been written: it just
hasn't been applied because it's difficult to make it work with things
like PTYs. For this application, we don't care about that at all). Plan 9
did all this two *decades* ago and got it right: why can't Linux do it
now?

It's not even hard to make a system that works like this in userspace,
thanks to FUSE.

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 20:57 UTC (Mon) by nix (subscriber, #2304) [Link]

Note that I'm not saying OH NOEZ PA SUCKS COS EVERYTHING IS NOT A FILE or
even 'oh no you must rewrite PA so that everything is a file'.

I'm just wishing for an ideal world in which the Unix philosophy was still
as universally followed as it was in its early days, and musing about ways
to make that happen. I think a working revoke() is the biggest missing
piece (and libraries to make exposing one's configuration state as a FUSE
filesystem painless).

LPC: Linux audio: it's a mess

Posted Sep 23, 2008 9:09 UTC (Tue) by fergal (subscriber, #602) [Link]

Seems to me that everything is a file doesn't have to mean that everything is a single file. You don't have to represent your whole soundcard via a single file. Think of /proc or /sys.

The problem is it is quite difficult for a program to present itself as a coherent set of files. While it can easily enough produce a file-like interface, it cannot easily produce a directory-like interface.

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 0:25 UTC (Fri) by jwb (guest, #15467) [Link]

Er, absolutely not. PulseAudio allows me to mute my web browser while still listening to my music. dmix does not solve that problem.

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 10:43 UTC (Fri) by djpig (subscriber, #18768) [Link]

Which is a severe misfeature in web browsers (and/or the respective multimedia plugins) actually, and which really can and should be fixed independently from the underlying sound system.

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 11:38 UTC (Fri) by ewan (subscriber, #5533) [Link]

Hardly. It's (almost) always better to fix a problem once, centrally,
than have everyone re-invent their own particular wheel. Not least
because it gives the user a single uniform interface for controlling
things, rather than having to hunt through each individual app.

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 21:22 UTC (Fri) by rahvin (subscriber, #16953) [Link]

I understand your point, but as a user if I want to control a feature of an application I expect that control to be in the application, not some central sound daemon. I would think the default user behavior when looking to control the sound of an application would look first in that application not at some upper level. I agree with the poster you are responding to, the problem with browser sound is with the plugin/browser, not some central sound control. I can't tell you how often I right click on some flash application thinking there will be a volume control in the context menu. There should be a control in that context menu, that there isn't is a problem with the application.

LPC: Linux audio: it's a mess

Posted Sep 20, 2008 3:41 UTC (Sat) by drag (subscriber, #31333) [Link]

Well it's confusing. I'll illustrate:

Say your using something like xmms or other application that has it's own volume control.

So you have the Gnome volume control stuff. It's the icon in your task bar and the mixer control stuff in your applications menu. This provides a 'sanatized' way to interact with volume controls. So you can have that nice little icon and the sound mixer buttons on your multimedia keyboard and whatnot.

Then you have the low-level Alsamixer interface. This reflects the actual hardware capabilities of your sound card. When your mucking around with the PCM slider your interacting with the sound card's hardware. It's confusing to use and each sound card has different capabilities so it's a UI that will change depending on what sound card you have.

Then you have your 3rd mixer interface at the application level.

Then on top of that people using desktops will have another volume control on their speakers. So that makes 4 different volume controls that interact with the sound card. Different applications will go through the Gnome stuff, interact with alsamixer directly, or have their own controls. Depending on how the application is configured it can interact with controls differently at different times. (ie, some apps can be configured to use OSS vs Alsa vs ESD vs artsd vs etc)

This is confusing.

Now say your trying to do a voice recording or interact with a VoIP application. Things get _very_ confusing.

With PA you have the master controls and the application controls in the same spot. Then that eliminates the requirements for mixing controls in your browser, file manager, your terminal's beeping, etc etc.

So this _should_ lower the mixing controls down to 2 on the software side. Alsa stuff should setup correctly, but it won't be for the significant number of people. So you still need alsamixer for getting the "baseline" set. But after that you should only need to deal with one interface.

It's still very goofy, but not as goofy.

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 2:11 UTC (Mon) by elanthis (subscriber, #6227) [Link]

Less controls is not the right solution.

Sometimes you want the same thing in multiple places to help usability.
Where you might think to look first isn't where someone else might look
first. It's pretty clear that some people are going to try to adjust the
sound volume on the thing making the sound (the individual app) and not the
mixer control on the panel.

However, PA is what makes that possible. Note that I said "the same thing"
in multiple places. The volume widget in the app should modify the volume
of the PA stream, and be reflected in the desktop mixer utility, and vice
versa. Without things like PA, this becomes much harder. It's not even
possible without a lot of crap being shoved into the drivers.

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 16:55 UTC (Mon) by drag (subscriber, #31333) [Link]

> Sometimes you want the same thing in multiple places to help usability.

Not when those multiple things do very different and conflicting actions in very very non-obvious manners.

If you have:
1. Mozilla Browser volume control and mute.
2. Gnome Volume Control and mute (that ties into your volume icon and multimedia buttons)
4. Low-level Alsa volume control and mute.
5. Speaker volume control and power.

Now you open up a youtube video. You get no audio.

Tell me, with certainty, which control, or combinations of controls, you will have to use to make the sound audible.

There is a difference between having 3-5 different places to control volume vs having 3-5 different controls you have to use.

LPC: Linux audio: it's a mess

Posted Sep 20, 2008 11:27 UTC (Sat) by dirtyepic (subscriber, #30178) [Link]

Well, when I want to control the sounds coming out of my laptop, I would generally head to the place where the sound controls are, not in every application on the system that has any kind of capability of producing noise. Do you really think having dozens of apps, each with their own independent volume control, is a good idea? It's stuff like that, everyone going off and implementing their own incompatible audio systems, that helped get us into this mess in the first place.

LPC: Linux audio: it's a mess

Posted Sep 21, 2008 22:10 UTC (Sun) by mezcalero (subscriber, #45103) [Link]

You're wrong.

One of the biggest problems in volume control we have right now is that we have too many volume controls in line. The app might have one, the device has a couple, the hardware might have even more. It is very confusing to the user to find his way through this jungle and figure out which one is the one that is responsible for the low volume of the output.

The fix for this is to coalesce all those volume controls at a single place which should naturally be the sound server. If applications then want to expose a volume control slider they should expose the one maintained by the sound server -- instead that everyone implements its own.

With the "flat volumes" feature of upcoming PA 0.9.13 we even will be able to coalesce per-stream and per-device volume into one.

In summary: requesting that each app should implement its own volume controls goes in the wrong direction. Bad idea. Right in contrary exporting vol control to the sound server is the right answer here.

LPC: Linux audio: it's a mess

Posted Sep 21, 2008 23:23 UTC (Sun) by PaulWay (subscriber, #45600) [Link]

I'd also add that it's trivially easy to posit an application which doesn't provide a volume control. Or an application which provides volume control, but hidden deep within labyrinthine controls - like MythTV or many games. Or an application which is normally quite acceptably loud most of the time but some times is too soft or loud - e.g. mplayer playing some video files, or in a situation where you want it playing in the background.

The real point here is that having every application (hopefully) implement a volume control (and hopefully do it well), and having it in a different location for each application, is a usability nightmare. Having one set of centralised controls smooths over a multitude of minor sins.

Have fun,

Paul

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 10:07 UTC (Mon) by hppnq (subscriber, #14462) [Link]

It is very confusing to the user to find his way through this jungle and figure out which one is the one that is responsible for the low volume of the output.

Considering that this argument is more or less the same as the one in favour of per-app volume controls, one would be inclined to think of a protocol as a solution for this kind of confusion. But it seems like this has already been patented. Sigh.

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 3:32 UTC (Fri) by elanthis (subscriber, #6227) [Link]

Remove PulseAudio if you want. Don't ask distros to remove it, though. I don't know about you, but I prefer having modern conveniences like USB hot-plugged audio hardware and Bluetooth headsets just working without having to restart my entire desktop to get applications to reconnect to hot-plugged ALSA hardware devices.

I like having software sound mixing that actually works. dmix never worked right for me. Not once. The few times it played sound, it played like crap. PulseAudio just works (except for Flash). Sounds great, no skipping, just works.

You have bugs you run into, I don't. Your solution is to throw away features a lot of people need to go back to the stone age where the shitty poorly-coded drivers and apps managed to narrowly avoid each others' bugs and happen to work. Intelligent people's solution is to go "OMG bugs! Let's fix them!" And then we all have perfectly working audio all the time by default.

File bugs. Or turn PA off. But don't scream about how PulseAudio is useless when it solves problems that ALSA never has and never will be able to solve, like hot-plugged devices.

(Personally, I still think ALSA was a mistake. These days all it amounts to is Linux being incompatible with every other UN*X OS's sound interface, practically _forcing_ developers to pick a sound server or wrapper library or some such just to have their damn app work on more than one distro.)

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 4:16 UTC (Fri) by drag (subscriber, #31333) [Link]

> (Personally, I still think ALSA was a mistake. These days all it amounts to is Linux being incompatible with every other UN*X OS's sound interface, practically _forcing_ developers to pick a sound server or wrapper library or some such just to have their damn app work on more than one distro.)

It's too bad that OSS didn't open source itself much sooner. It had lots of improvements that no distro could ship and the kernel couldn't support.

Considering the lack of resources devoted towards Linux sound driver development I think that the Alsa folks have done a good job.

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 18:51 UTC (Fri) by dmarti (subscriber, #11625) [Link]

sitting at Plumbers Conference right now listening to Lennart Poettering summarizing the audio track. Problem is: "OSS is not virtualizable." Write to the OSS API and you can't feed it to a sound server -- it has to be on the hardware.

LPC: Linux audio: it's a mess

Posted Sep 20, 2008 13:12 UTC (Sat) by cortana (subscriber, #24596) [Link]

Isn't that the case for ALSA too if you directly fool about with the nodes in /dev/snd?

The virtualisability of ALSA comes from the alsa-lib library that all applications use. But that could, presumably, fire off data destined for the sound card to /dev/dsp instead of /dev/snd/pcmC0D0p...

Yes

Posted Sep 21, 2008 3:43 UTC (Sun) by dmarti (subscriber, #11625) [Link]

Yes, as Lennart pointed out, only 70% of ALSA apps use it in a way that will work with a sound server -- the other 30% only work if they can get access to the hardware.

LPC: Linux audio: it's a mess

Posted Sep 21, 2008 22:39 UTC (Sun) by mezcalero (subscriber, #45103) [Link]

OSS is practically impossible to virtualize. There are hacks like LD_PRELOAD and stuff like CUSE. But they will never be able to fully provide virtualization. Stuff like mmap audio access (which is used by far too many applications unfortunately) cannot be emulated completely. Also, the timing model in OSS is too simple to really fit on a sound server backend.

OSS as a software is dead on Linux. I really hope OSS as an API will die, too. (as soon as libsydney is more than just vaporware)

ALSA can be virtualized much better than OSS, since it has better semantics and is a shared library which supports plugins and stuff. But in the end it's an API desgined for hardware devices, and the devil lies in the details: 100% ALSA compat in PA will stay a dream.

Lennart

LSB

Posted Sep 21, 2008 23:29 UTC (Sun) by dmarti (subscriber, #11625) [Link]

But it sounds like ALSA in some form is on track to make it into LSB -- and once it's there, apps and distributions will depend on it and continue to support it for a long time. Possible to make sure that the subset of ALSA that's in LSB is PA-safe?

LSB

Posted Sep 22, 2008 0:21 UTC (Mon) by mezcalero (subscriber, #45103) [Link]

Yes, of course. Applications that use only a "safe" subset of the ALSA API should work fine with PA. That's what I'd call 70% compat.

Lennart

ALSA safe subset

Posted Sep 23, 2008 8:57 UTC (Tue) by sdalley (subscriber, #18550) [Link]

Is this safe subset of ALSA documented as such anywhere? If so, we need as many people as possible to BANG A REALLY BIG DRUM about it/write a beautiful Programmers Guide about it/write love songs in it, so that Linux sound applications everywhere can start getting things right now and not break/have to be ported again when libsydney comes along.

ALSA safe subset

Posted Jun 29, 2009 16:23 UTC (Mon) by pharm (guest, #22305) [Link]

Just in case this pops up in Google searches:

ALSA safe subset documentation, amongst other things:

http://0pointer.de/blog/projects/guide-to-sound-apis

LPC: Linux audio: it's a mess

Posted Sep 21, 2008 22:42 UTC (Sun) by mezcalero (subscriber, #45103) [Link]

OSS's general design (i.e. having device files as API for applications and stuff) is a big failure. I am pretty happy we are not stuck with OSS.

ALSA is great. It's API is complex, has issues, but still far better than OS. *far* better.

Lennart

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 9:51 UTC (Fri) by NAR (subscriber, #1313) [Link]

I prefer having modern conveniences like USB hot-plugged audio hardware and Bluetooth headsets just working [...] PulseAudio just works (except for Flash).

How many people want to use USB hot-plugged audio hardware and bluetooth headsets? How many want to watch (and listen to!) Flash? I think there are a couple of order of magnitude difference between the two. If a project goes for the first class of users instead of the second, then it doesn't have place in the default installation of a general purpose distribution. Actually Ubuntu Hardy breaking Youtube with PulseAudio was the last bit of motivation for me to move back to Windows on my home desktop.

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 12:35 UTC (Fri) by shalem (subscriber, #4062) [Link]

Really people,

You should refrain from making comments about something you've clearly not given a serious try. Yes flash does not work as is with pulseaudio because its a *closed source* app which is abusing the alsa interface in many horrible ways.

Not working as is does not mean that it does not work though, Lennart has gone to many troubles to make it work. So all you need todo (on Fedora, other distros will have something similar) is: "yum install libflashsupport" and after that flash will work fine with pulse. Since you are manually installing flash anyways, having to install an additional package which helps integrate flash better into the system (it does more then just get pulse to work with flash) is really not all that much to ask.

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 13:04 UTC (Fri) by NAR (subscriber, #1313) [Link]

Actually I did install libflashsupport, but no luck.

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 14:12 UTC (Fri) by drag (subscriber, #31333) [Link]

Don't blame Pulseaudio for Ubuntu's goof with it. I know people still had some issues, but with Fedora it really did work very much better then Ubuntu.

The PA in Hardy was, as far as I can tell, a victim of bad QA.

And Adobe Flash does crash out my browser quite often. Most of the time when I close out a window which I just watched a video, which follows that race condition described in the above article. This hasn't anything to do with PA, since you know, I am not actually running it.

I am sure you have other good reasons, but switching to Windows to fix Youtube is a bit drastic when it could of been solved by turning off PA or/and installing some other flash player. :)

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 14:24 UTC (Fri) by NAR (subscriber, #1313) [Link]

Well, the youtube problem was just one too many. It used to work with Gutsy, didn't work with Hardy. Turning off pulseaudio didn't solved the problem and the audio wasn't in sync with other flash players.

Actually the browser window problem didn't affect me, because I use tabbed browsing, just one window which I don't close.

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 14:47 UTC (Fri) by drag (subscriber, #31333) [Link]

I understand what your saying. It's too bad because more and more common to see experienced Linux users switching to Mac or Windows-based systems because they've just plain gotten tired of hacking around the same problems over and over and over and over again.

The Linux desktop experience is much more buggy and crash happy then Windows nowadays. People need to learn that having the source code is not a acceptable substitution for binary compatibility and good quality control mechanisms. It hurts open source software just as much as closed source stuff.

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 18:06 UTC (Fri) by einstein (subscriber, #2052) [Link]

> The Linux desktop experience is much more buggy and crash happy then Windows nowadays.

LOL, I don't think so - although, who knows - I haven't run windows for some time, does it not crash anymore? Wow, that would be one for the books.

The linux desktop with pulseaudio has been working well for me (suse 11.0 at home, ubuntu hardy heron at work) but then again, perhaps my sound hardware is blase? (intel built-in) I do the usual, some web browsing, some gaming, some movies, some music. Other than a wrapper script that I needed for my 9 year old copy of quake 3 arena, everything just seems to work, out of the box.

Alternative for Adobe Flash users

Posted Sep 19, 2008 18:40 UTC (Fri) by dmarti (subscriber, #11625) [Link]

Don't know about anyone else but I have replaced Adobe Flash on my main laptop with Gnash. When I started running it this spring, it played YouTube usually, other Flash video sites almost never. Now YouTube always works and other Flash video sites work sometimes. Not ready for Grandma yet, but I'm not a big net video watcher and don't play Flash games much, so close enough for me.

Alternative for Adobe Flash users

Posted Sep 20, 2008 10:08 UTC (Sat) by DonDiego (subscriber, #24141) [Link]

What about Gnash's speed? Adobe Flash is already incredibly and annoyingly slow, will I have to invest in new hardware to run it? Yes, I still happily use a 500MHz PC...

Alternative for Adobe Flash users

Posted Sep 20, 2008 19:04 UTC (Sat) by jlokier (subscriber, #52227) [Link]

I ran Gnash on my Ubuntu Handy install for a few months - and it worked ok, and played Youtube videos for a while. But after a while it stopped playing Youtube and all other videos - it would just sit there with a spinner.

So I had to remove Gnash and replace it with the original Adobe Flash plugin.

(Unfortunately although there's a straightforward UI from Firefox asking which Flash plugin you'd like to install (I'd picked Gnash), there seemed to be no UI for removing or changing it.)

Disable Flash in the Browser

Posted Sep 25, 2008 13:12 UTC (Thu) by alex (subscriber, #1355) [Link]

Install both and then disable one with Tools/Add-ons/Plugins in Firefox 3?

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 22:11 UTC (Fri) by nix (subscriber, #2304) [Link]

Windows never crash? Well, it doesn't bluescreen much anymore, but it's
still deadlock-prone, especially if it loses network connections.

e.g. the backup domain controller went down at work today (our desktops
are Windows XP *sigh*). Within seconds *everything* on *everyone's*
desktop had frozen. Even my VNC and X sessions were stalled. Most of the
mouse pointers had frozen (but not all), and ctrl-alt-del did nothing.
Even the primary domain controller froze. It all stayed stalled until both
the primary and backup domain controllers were simultaneously rebooted.

That's robustness for you. (If everything's frozen solid, it can't crash.)

The time was when heavily-interlinked NFS systems could do that in the
Unix world, but I haven't seen anything like it for many years, and even
at its worst Unix gave you more tools to diagnose it than roomfuls of
simultaneously-frozen boxes. In some ways Windows is going *backwards*,
even before you look at the Vista trainwreck.

It's not the only mess

Posted Sep 21, 2008 10:07 UTC (Sun) by man_ls (subscriber, #15091) [Link]

But still it is true that the Windows desktop has advanced a lot in that respect: my corporate w2k desktop has not frozen once in a year. Mac OS X has similarly become a lot more robust in the intervening years since I stopped using it. Meanwhile Linux (and its many variants) regularly breaks hardware compatibility, shows multiple regressions and makes you play with your kernel every once in a while.

Maybe I'm biased because I use only debian testing, and maybe Mandriva or Suse are better. And of course there are areas where it excels (such as performance and support for older hardware). But I'm not sure there aren't at least some engineering reasons for the lack of quality. Windows or Mac OS X have one wireless API well thought from the beginning, and most gadgets work fine with it (while Linux still struggles with common wireless chips). More to the point, they have not had two major rewrites of their sound systems in a few years (ALSA and PulseAudio).

Things should just work, and once they are working they should just keep working, without excuses.

LPC: Linux audio: it's a mess

Posted Sep 21, 2008 20:29 UTC (Sun) by NAR (subscriber, #1313) [Link]

The time was when heavily-interlinked NFS systems could do that in the Unix world

This is very much present tense at my workplace...

LPC: Linux audio: it's a mess

Posted Sep 21, 2008 23:57 UTC (Sun) by nix (subscriber, #2304) [Link]

'-o bg' solves most of that these days :)

LPC: Linux audio: it's a mess

Posted Sep 21, 2008 22:55 UTC (Sun) by mezcalero (subscriber, #45103) [Link]

The flash crash happens when you close a Flash animation (i.e. usually change/close a web site), not when you close a window.

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 14:56 UTC (Fri) by ewan (subscriber, #5533) [Link]

And Adobe Flash does crash out my browser quite often.

As well as having a working libflashsupport Fedora also runs plugins inside nspluginwrapper, even on 32 bit installs. That means that flash runs isolated in its own process and doesn't take out your browser when it dies.

It is possible to get this right; if Ubuntu isn't doing it that's just an Ubuntu bug.

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 15:02 UTC (Fri) by drag (subscriber, #31333) [Link]

Well this is Debian Lenny/Sid with epiphany-webkit.

Also the problem crops up in other applications that can support Mozilla plugins, like Liferea.

I will take a look at nspluginwrapper, though.

LPC: Linux audio: it's a mess

Posted Sep 20, 2008 6:35 UTC (Sat) by drag (subscriber, #31333) [Link]

Just for the record.

I installed and got PulseAudio working in Debian. And everybody above was right.. flash 9 doesn't work for shit. If I do the "export FLASH_FORCE_PULSEAUDIO=1" I can get it to sort of work for firefox, but not for anything else.

So I installed the Flash 10 beta and it seems to be much much better. Works happily with pulseaudio, but I had to disable framebuffer compression with the Intel driver stuff to get it to work without video corruption (of the flash video, not the display). This was in Debian Sid.

LPC: Linux audio: it's a mess

Posted Sep 21, 2008 22:53 UTC (Sun) by mezcalero (subscriber, #45103) [Link]

Actually libflashsupport does not fully fix the problems Flash9 has with PA.

The libflashsupport interfacing in Flash9 is racy. Thus you might get a freeze or crash in some cases when you close a web site in your browser. All people that had a look on this problem agree that we cannot really work around this in our code -- it is Adobe's job to fix this. And you know what? Adobe actually did so, in Flash 10.

So, in summary: Flash 9 on PA is not perfectly stable, regardless how you run it. But the people to blame for that are not us, it's Adobe. Please point your fingers in the right direction when you complain!

(Running Flash 9 in libflashsupport in a plugin wrapper is the best way to run Flash on PA right now. That way crashing flash will not bring your entire browser down -- and if you ask me, allowing closed source software to run inside the browser process is a bad idea anyway.)

Lennart

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 7:41 UTC (Mon) by NAR (subscriber, #1313) [Link]

But the people to blame for that are not us, it's Adobe. Please point your fingers in the right direction when you complain!

From the developer's point of view, you're right. From the user's point of view, you're wrong. Youtube used to work with Ubuntu Gutsy (I don't know about your browser, but it doesn't crash mine), doesn't work anymore after the Hardy upgrade (which introduced PulseAudio), so it's obviously PulseAudio is the one that broke my user experience.

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 8:16 UTC (Mon) by ceplm (guest, #41334) [Link]

No, it's Ubuntu which did not make sure, that flash works.

LPC: Linux audio: it's a mess

Posted Sep 21, 2008 22:46 UTC (Sun) by mezcalero (subscriber, #45103) [Link]

Priorizing support for broken closed-source software over a good, modern, well supported core sounds like a really really bad idea to me. If you really think that we should allow Adobe to take the Linux audio system hostage like this than please ... go away, use Windows.

Adobe's audio interfacing in Flash 9 was a big failure. But they fixed things in Flash 10 for us now. It's getting better.

Lennart

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 15:03 UTC (Fri) by smorovic (guest, #52892) [Link]

ALSA isn't forcing anyone to move to a new API. The old OSS v3 emulation is in kernel for all the older apps.

For solving pulse (but also alsa) oss compatibility problems, I have high hopes on this project:
http://lwn.net/Articles/296389/

If it can provide near-100% emulation it would be a silver bullet for PA voes, because many Alsa apps also support OSS.

LPC: Linux audio: it's a mess

Posted Sep 20, 2008 13:16 UTC (Sat) by cortana (subscriber, #24596) [Link]

The problem with the alsa-provided /dev/dsp is that programs using it don't get software mixing. So such programs will block off all sound for all other programs on the system; or if sound is already playing, they will fail to play sound themselves.

LPC: Linux audio: it's a mess

Posted Sep 20, 2008 20:43 UTC (Sat) by adamgundy (subscriber, #5418) [Link]

the 'cuse' OSS emulation smorivic linked to is an attempt to fix that disastrous mess..

as you said, if you use ALSA's built in OSS emulation, that's a kernel module, and competes with any other ALSA source for the sound device. most newer sound devices (consumer, at least) only support a single playback channel, and expect software mixing to be done... but dmix (ALSA's mixer) is a userspace mixer using the ALSA drivers, which means the ALSA-OSS kernel module can't use it, hence blocking audio (it also means no mixing is available for OSS users, so only one at a time can use OSS)

CUSE is a way of providing /dev/dsp and friends for OSS emulation from userspace, so of course a userspace OSS emulator can use dmix, pulseaudio, or whatever and play nicely with other apps.

CUSE lets a userspace program catch the full range of operations on the /dev/dsp device, so it can be a full emulation, unlike tools like aoss which hack into glibc's open calls and try to detect opens of /dev/dsp and do the emulation there (they can't catch everything, so some stuff works with aoss, other stuff doesn't).

LPC: Linux audio: it's a mess

Posted Sep 21, 2008 8:27 UTC (Sun) by mlankhorst (subscriber, #52260) [Link]

Alsa already had oss support in the form of aoss, which hooks all ioctl's
and read/writes to /dev/dsp, so the only thing new is that this form uses
CUSE instead of hooking function calls with preloading.

LPC: Linux audio: it's a mess

Posted Sep 21, 2008 22:20 UTC (Sun) by adamgundy (subscriber, #5418) [Link]

the point is that aoss (and other variants) can't catch *everything* (mmap, etc), but I believe CUSE is intended to allow a user space process to catch all the same stuff as the kernel mode emulation can.

also, aoss is a pain for most users, because they have to start the program they want to 'emulate' under aoss, typically from a command line. CUSE just sits there as a daemon, waiting for any process to attempt to use /dev/dsp.

LPC: Linux audio: it's a mess

Posted Sep 21, 2008 23:03 UTC (Sun) by mezcalero (subscriber, #45103) [Link]

PA has "padsp" which is similar to "aoss".

However, this kind of virtualization is always imperfect, won't work for more complex applications that use mmap, or make assumptions about timing and hardware that a software sound server cannot meet.

Doing emulation with LD_PRELOAD is always hacky, incomplete, and can only be a temporary solution that only works in very specific cases.

Lennart

LPC: Linux audio: it's a mess

Posted Sep 21, 2008 23:07 UTC (Sun) by mezcalero (subscriber, #45103) [Link]

CUSE is certainly a good thing. But it won't solve our problems entirely. OSS allows clients to read the playback pointer of the hw which apps use for synchronization. Now, in sound servers (and even on stuff like USB) we have a hard time to support this, since we usually have an extra latency "behind" the playback buffer. So, either the timing the application gets will be way off what actually is true -- or we will have to minimize that aforementioned extra latency at the cost of high CPU load.

So again. Regardless which approach we use: 100% compat won't happen.

LD_PRELOAD is good for some things, CUSE for others. But in the end, 100% OSS compat that works for everyone and always will stay a dream.

OSS as and API needs to go away.

Lennart

LPC: Linux audio: it's a mess

Posted Sep 21, 2008 22:59 UTC (Sun) by mezcalero (subscriber, #45103) [Link]

We will never be able to provide 100% OSS compat in PA -- and not even 100% ALSA compat.

Also, note that the OSS compat ALSA in the kernel provides is far from perfect. Doesn't support userspace plugins -- it hence is incompatible with userspace plugins, such as dmix, and everything else.

Also, OSS as an API is broken in the timing model. The timing model is so broken that it desn't really work properly on USB audio.

Lennart

LPC: Linux audio: it's a mess

Posted Sep 21, 2008 22:35 UTC (Sun) by mezcalero (subscriber, #45103) [Link]

I want to make clear that I (as the PA guy) don't consider ALSA a "mistake". It's good stuff actually. Of course, it has issues -- as every system has. But considering all the options it's not really that bad at all.

People like to construct some kind of opposition between ALSA and PA. They fail to see the big picture. ALSA is a building block PA builds on. And that's good. The ALSA and the PA devs work together. We're not in competition. If you think we are, you don't know the details.

Lennart

LPC: Linux audio: it's a mess

Posted Sep 20, 2008 7:51 UTC (Sat) by jeremybar (subscriber, #49718) [Link]

I agree, I always disable the sound server and everything seems to start working :-)

LPC: Linux audio: it's a mess

Posted Sep 21, 2008 20:55 UTC (Sun) by mezcalero (subscriber, #45103) [Link]

I think your view on what PA is is a bit too simple. It's actually a little bit more than just some code that mixes audio for you. It allows you to switch streams on-the-fly between devices, does upmixing/downmixing (stereo to surround and so on), allows you to combine multiple sound cards into one, allows you to have seperate volumes for all streams you play, and so on, and so on, and so on.

The most interesting feature in my personal opinion however is the new 'glitch-free' core as described on my blog a while back: http://0pointer.de/blog/projects/pulse-glitch-free.html -- this is of course not directly visible to the user, but nonetheless a major feature of PA that you really cannot do without on a modern operating system.

Also, please note that Takashi Iwai maintains PA in OpenSUSE. Takashi is one of the two lead figures of ALSA. I think this little fact should make clear to you that even the ALSA people themselves believe in a need for a userspace sound system that goes beyond what ALSA provides.

The misconception that PA is little more than a replacement for dmix is popular, but nonetheless a .. misconception.

If you have trouble with PA's CPU load then try a different resampler. ALSA's internal resampler is very very primitive and sounds terrible. It uses little CPU, but is one of the weaker points in ALSA. Claiming that PA is a regression over ALSA in this area is probably simply caused by setting different priorities.

Lennart

Gotta say "I told ya so!"

Posted Sep 19, 2008 1:47 UTC (Fri) by jmorris42 (subscriber, #2203) [Link]

Last year when Pulse Audio went into Fedora 8 I posted this: http://lwn.net/Articles/256647/

Wherein I joked about the next shiny thing attracting their attention. It was called FUD. Note how it isn't a year later and we read an announcement about yet another audio API for everyone to port to is the works. This is insane. Is it any wonder every application that uses audio does it through an abstraction layer?

Gotta say "I told ya so!"

Posted Sep 19, 2008 3:37 UTC (Fri) by jmspeex (subscriber, #51639) [Link]

PulseAudio is still the way to go. The real problem is at the API level. That's what libsydney is trying to solve. It's just an API that would talk to PulseAudio -- or any other audio subsystem (there's already an imcomplete implementation for Windows and maybe MacOS).

Gotta say "I told ya so!"

Posted Sep 19, 2008 5:16 UTC (Fri) by bronson (subscriber, #4806) [Link]

What will libsydney provide that PortAudio doesn't?

Trying to find an answer, at the bottom of the libsydney link above triton asks, "As for the lib, wouldn't it be easier to fix PortAudio? Why reinvent yet another audio lib? NIH syndrome?"

Overall I'm really happy with Pulse but I'm very skeptical about libsydney... Does this world need yet another audio API for developers to choose from?

Gotta say "I told ya so!"

Posted Sep 20, 2008 0:14 UTC (Sat) by jmspeex (subscriber, #51639) [Link]

I remember there were a couple of issues, but I don't remember the details. I think low-latency support was one of these issues. libsydney is designed to be usable both by media players and by applications that require a very low delay (few ms).

Gotta say "I told ya so!"

Posted Sep 20, 2008 20:42 UTC (Sat) by salimma (subscriber, #34460) [Link]

Hope they come up with a good name for it too. libsydney and libcanberra are both very.. obsfuscating.

way [OT], but...

Posted Sep 25, 2008 2:46 UTC (Thu) by roelofs (subscriber, #2599) [Link]

jmspeex

Obscure Babylon 5 reference?

Greg

Gotta say "I told ya so!"

Posted Sep 21, 2008 23:20 UTC (Sun) by mezcalero (subscriber, #45103) [Link]

The buffering model in PortAudio is too simple, too hardware-bound. A modern audio API should be a deparature of hw-specific fragment-based buffer metrics. Instead, we need to allow applications to set values that are actually understandable (such as "latency" and stuff) and default to a model that defaults large playback buffers with the option to rewrite them on request. Why? This will save us power (in conjunction with 'glitch-free' PA that is) and will greatly enhance network-transparent audio playback.

That is one of the more fundamental issues I don't think that PortAudio is the way to go. There are more. And it's a feeling shared by a couple of other audio related people.

Fixing the fact that we have to many competing audio APIs on Linux by adding another one is of course paradox. But still, after discussing this forth and back I believe this is the right way.

Lennart

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 6:00 UTC (Fri) by error27 (subscriber, #8346) [Link]

There is no mention that a lot of apps still use OSS and that's not going to change. OSS is apparently what they use on BSD and some people find it easier than ALSA. OSS emulation under Linux sort of craps things up a bit basically.

Potentially doing OSS emulation in userspace will fix that?

http://lkml.org/lkml/2008/8/28/270

Also the article talks about all the volume knobs being a problem but did anyone come up with a solution? Probably Linux sound is way way too buggy to fix in a one hour session... :/

LPC: Linux audio: it's a mess

Posted Sep 23, 2008 6:09 UTC (Tue) by angelortega (subscriber, #1306) [Link]

Programmers use OSS and not ALSA because the OSS API is *much* easier to
use than the ALSA library mess.

Also, regarding libsydney, creating yet another wrapper library is not the
solution. If you try to fix the problem of having 3 competing parts by
adding another one, you end up with 4 competing parts and no solution.

That's why we have Phonon in Qt/KDE

Posted Sep 19, 2008 6:50 UTC (Fri) by aleXXX (subscriber, #2742) [Link]

This article makes it very obvious why we decided to have our own
multimedia wrapper layer for Qt4/KDE4 apps, Phonon.
There is OSS, ALSA, GStreamer, libxine, MAS, NMM, JACK, now also
PulseAudio, and what is libsydney and portaudio ?
So instead to decide for any one of these now and then have to support it
for the years to come (no matter whether the actual developers are still
maintaining it or not), we created Phonon, which is *THE* portable
multimedia API (*NIX, OSX, Win) if you are writing a Qt4/KDE4
application.

Alex

That's why we have Phonon in Qt/KDE

Posted Sep 19, 2008 7:04 UTC (Fri) by Tuxie (subscriber, #47191) [Link]

libsydney and portaudio are wrappers like Phonon. In the same category there are also libao, SDL-audio and others.

What I miss most with PulseAudio is AC3/DTS/AAC passthrough. I hope that will be implemented soon. It's not too uncommon to have the computer connected to a receiver via SPDIF nowadays, and having it go through the AC3 -> decode -> PA -> ALSA AC3-encoder -> receiver is a lot worse CPU wise than just AC3 -> receiver when the system is already struggling with being able to decode the 1080p video stream in realtime.

That's why we have Phonon in Qt/KDE

Posted Sep 19, 2008 9:50 UTC (Fri) by Frej (subscriber, #4165) [Link]

Same issue here :)

That's why we have Phonon in Qt/KDE

Posted Sep 21, 2008 23:23 UTC (Sun) by mezcalero (subscriber, #45103) [Link]

phonon is a very different wrapper than libsydney and portaudio. The latter two wrap PCM output, the former a media decoder with a couple of other things.

Lennart

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 8:15 UTC (Fri) by tialaramex (subscriber, #21167) [Link]

From my local informal sample, PulseAudio broke things for more than 50% of people, and "fixed" things, in the sense that sound began working properly when previously it hadn't, for less than 10%. Worse, it made diagnosing sound problems significantly harder, because it added another abstraction layer which has to be isolated in a divide-and-conquer strategy. Locally my solution was to disable PulseAudio and hope that I don't wake up one day to find that Lennart has persuaded Red Hat/ Fedora to disable the non-PulseAudio support in my audio apps.

Lennart should have concentrated on the part of the problem where he has relevant expertise, which is the network layer. If he'd built a fast and reliable framework for seamless network audio then people might entertain the idea that it's worth some disruption to get it onto everyone's desktops. But instead the argument seemed to be that Lennart would "fix" local audio (which was just coming together nicely with better ALSA drivers and dmix). He has comprehensively failed to do that.

Indeed Lennart has spent a LOT of time moaning at ALSA developers, often based on entirely false premises about how audio actually works on hardware people own, and he did not need to build yet another audio API to do that. If Red Hat had paid him to spend some time herding cats over at ALSA I'd be overjoyed, but instead this seems to have been merely a fortuitous side effect of the otherwise less than successful PulseAudio.

LPC: Linux audio: it's a mess

Posted Sep 21, 2008 23:40 UTC (Sun) by mezcalero (subscriber, #45103) [Link]

You seem to know a lot of stuff about me. I am not particular fond of people who turn their dislike of software I wrote into personal accusations and judging in a public forum.

Also, what brings you to the conlusion that "Lennart has spent a LOT of time moaning at ALSA developers"? Sure, I have my issues with some parts in ALSA, and from time to time I let people know about them. But hey, nothing is perfect. Not ALSA. Not PA. In fact, I regularly defend ALSA -- far more often then criticising it. And I think I am pretty vocal about my support of ALSA. Just check the other comments I wrote to this article. ALSA is good stuff -- with some issues. More or less exactly like PA. Nothing is perfect -- and certainly software isn't either.

You are believing that ALSA and PA is in competition in some way. It is not. The ALSA devs and I work together. They already fixed quite a few of the issues I had with ALSA. That's collaboration. This is great, and I am happy. In fact at the conference this article was about Takashi and I met once again and we had very fruitful discussions.

You seam to be more of an ALSA fanatic that the ALSA guys themselves. Don't forget that Takashi even packages PA for OpenSUSE!

There is just no competition between ALSA and PA, since PA builds on ALSA. Everyone who thinks there was a competition is wrong, and doesn't get the big picture. Please get your facts right.

Lennart

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 18:19 UTC (Mon) by tialaramex (subscriber, #21167) [Link]

The conclusion that you'd spent a lot of time moaning about ALSA was based on reading ALSA's archives in the past and on observing that there is (was?) a lengthy list of alleged problems in ALSA on your PulseAudio site. As far as I know I've never met you in person, and so I have no opinions about you except as a software developer.

ALSA isn't perfect, that's true, I have several enhancement bugs open and I've had discussions with ALSA people about various aspects -- notably I wanted a way to determine which ALSA level ("volume") control, if any, is the most appropriate to tweak for a PCM stream I have open, thus making it possible to make good use of a hardware PCM stream mixer as provided in say an SBLive. The kernel drivers are also not perfect, the driver for my Intel chipset can't properly handle a running audio output during suspend for example. But PulseAudio doesn't (and shouldn't try to) fix that problem.

I am a JACK user, so I am not, as you seem to assume, wedded to ALSA as the solution to all problems. I do think that you underestimated how mature PulseAudio needed to be before it was a better option than the default dmix configuration present in e.g. Fedora prior to PulseAudio. Not so long ago I encountered an old friend who happens to be a Google engineer (I say this only to establish that he's a smart get-things-done guy) who'd basically given up using audio on his laptop because he upgraded Fedora and that installed PulseAudio and broke the sound. PulseAudio's promise is good, but as of the last version I tried (was forced to try) it doesn't deliver that promise for a lot of people yet.

I remain to be convinced about PulseAudio. But don't imagine that I can't be convinced. I wasn't very impressed with early versions of NetworkManager either, but it got better. Probably I'd be less soured on PulseAudio if it had been in Fedora as an optional extra, to get feedback from people who know what they're getting into and perhaps have good use cases (e.g. hotplugging audio devices).

LPC: Linux audio: it's a mess

Posted Sep 23, 2008 13:42 UTC (Tue) by mezcalero (subscriber, #45103) [Link]

The list of ALSA issues I maintain still exists and is regularly updated:

http://pulseaudio.org/wiki/AlsaIssues

Lennart

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 8:28 UTC (Fri) by jwoithe (subscriber, #10521) [Link]

"Apple and Microsoft each have a single sound server that does both desktop and pro audio"

This is not strictly true for windoze as I pointed out during the discussion at LPC. While "desktop" audio applications make do with the standard windoze audio system, professional audio interfaces all ship with ASIO/ASIO2 drivers for a variety of reasons - not the least of which was the inability of the standard windoze audio system to provide low latency performance (in fact, over the years there have been other APIs besides ASIO). Vista may be different in this respect but it has other audio-related problems.

MacOSX has CoreAudio which, on the face of it, does integrate the consumer and pro requirements under one API. However, digging deeper one realises that it is probably closer to the pro side of the equation, with added features allowing seemingly easy access for consumers. At least at first glance CoreAudio is closer to JACK than ALSA/PulseAudio.

"... but nobody at the session seemed to have much interest in that direction for Linux"

It's not that we're not interested - the ideal solution is to have one sound system to rule them all. The problem is that the requirements of desktop users and pro audio users are much more divergent than many people realise. Getting a single audio system to satisfy all usage constraints while still performing efficiently is very difficult. As an example, desktop/laptop users are primarily interested in lowering power consumption, but doing this while maintaining a low latency audio system
is problematic.

The other issue is that JACK is much more than just a route to audio hardware. It implements extensive and flexible inter-application audio and midi routing - something PulseAudio doesn't do simply because it's not a feature required by desktop users. JACK also gives synchronous operation of all connected applications and provides sample-accurate synchronisation, but again these features are not required by the average desktop users. Oh yes, network audio *does* matter to pro audio users - and for that we have netjack.

What this boils down to is that JACK and PulseAudio are working in different problem spaces. They are optimised for different situations, and trying to make one also provide the functionality of the other will only result in a system with suboptimal performance.

Thank you

Posted Sep 19, 2008 18:43 UTC (Fri) by dmarti (subscriber, #11625) [Link]

Thanks for the corrections -- does netjack have a low enough latency to work for recording?

Thank you

Posted Sep 19, 2008 21:43 UTC (Fri) by jebba (subscriber, #4439) [Link]

"Thanks for the corrections -- does netjack have a low enough latency to work for recording?"

Yes. You can also do things like have a remote machine just do the LADSPA filtering to distribute the load, for example. It works and is solid. I've done similar things with pulseaudio and it felt like I had just duct taped a boat together. Jack rules.

Also, netjack2 is under development with the next generation of jack, which is based on jackdmp.
http://trac.jackaudio.org/wiki/WalkThrough/User/NetJack2

I wish the netgods would all settle on jackd and figure out how to solve its power consumption issues, if at all possible. Use the best! If some of the features are overkill for the desktop, just don't expose them in the simpler GUIs...

-Jeff

Thank you

Posted Sep 20, 2008 7:24 UTC (Sat) by jwoithe (subscriber, #10521) [Link]

"I wish the netgods would all settle on jackd and figure out how to solve its power consumption issues, if at all possible."

Although completely untested, I wonder if simply specifying a very large period size for JACK would go some way to addressing this. At least some of the reason for the high resource usage of JACK is that currently most of its users require low latency and so run with very small buffers. Desktop users don't require low latency, so running large buffers wouldn't matter there. Larger buffers mean fewer wakeups and thus lower power consumption.

JACK is certainly not a replacement for anything else at present, but I do wonder whether with some work it could be used as the basic building block for a universal audio system.

The other thing worth reiterating here is that JACK does much more than just provide access to audio hardware. It is also a signal router allowing audio to be routed arbitarily between applications as well as to/from a physical soundcard. This, along with its low latency performance, is what makes it extremely useful to pro audio and music production users.

Thank you

Posted Sep 20, 2008 7:52 UTC (Sat) by jebba (subscriber, #4439) [Link]

"I do wonder whether with some work it could be used as the basic building block for a universal audio system."

As an example, I use it the base of http://freeeee.org a distro for eee pc.

Thank you

Posted Sep 21, 2008 23:55 UTC (Sun) by mezcalero (subscriber, #45103) [Link]

Hehe, I like that. Is that a distro optimized for minimal battery lifetime of your Eeepc? Sounds like a great selling point, makes a lot of sense to me ;-)

Lennart

Thank you

Posted Sep 23, 2008 0:46 UTC (Tue) by jebba (subscriber, #4439) [Link]

"Is that a distro optimized for minimal battery lifetime of your Eeepc?"

There are two FREEEEE modes[1]: full-on media and low power ebook.

With jack, multiple mplayers, freej, firefox, chat, xournal, an offline copy of wikipedia, an image viewer, wifi, camera, etc all running it runs hot, yes. :) But it also runs all that fine without dropouts or crashing. jack performs reliably *now* and is never the weak link. Occasionally I have used pulseaudio as a jack client, which is convenient, but it is no where near as reliable as jack.

jack is clean, "feels" like a nice unix utility (it does its "one thing" extremely well), and has a committed upstream community.

-Jeff

[1]Second mode yet to be completed :)

Thank you

Posted Sep 21, 2008 23:54 UTC (Sun) by mezcalero (subscriber, #45103) [Link]

It's not just about using 2s large buffers. If you use large buffers like that you also need to allow rewriting of them (which means "rewinding" of all filters in the system and so on). Because if you couldn't rewrite them pressing stop or start in your media playback would actually only result in what you hear 2s later.

Allowing rewrite of audio buffers is what JACK doesn't do. And adding this makes things very complicated. I did it for PA, but I simply see no reason for JACK to do it as well. Also, since JACK allows much more complex routing than PA, the rewriting buffer issue becomes equally more complex. i.e. every part that becomes part of a JACK pipeline would need to be able to "rewind" its state. That means every JACK client application would have to be patched non-trivially.

Lennart

Thank you

Posted Sep 21, 2008 23:48 UTC (Sun) by mezcalero (subscriber, #45103) [Link]

PA now uses a "glitch-free" playback model which allows dynamic adjusting of the latency (and thus CPU load) to what the currently connected clients require without any discontinuities in audio. That's why PA is (or "can be" in a good setup) more power friendly than JACK is.

JACK is optimized for lowest latencies, CPU load comes second, and dynamic interruption-free changing of these parameters is not required. For PA however, we want dynamic latencies, generally low CPU load and the parameters need to be flexible.

But even if you would add "glitch-free" to JACK (which makes no real sense), it still is not designed for desktop use. PA carefully makes sure not to convert (or not even touch) audio data if not necessary. OTOH JACK only knows FLOAT samples, and thus audio type conversion is necessary usually at output and often enough on input too. That makes a lot sense for them. It's a very good choice, but it's not the right choice for the desktop.

The requirements of desktop/embedded use, and pro audio use are different. This results in different products. Trying to marry them is a great idea, but also very difficult to achieve. Of course, we should be trying to do this, but we have much bigger problems to fix before we start with that.

Lennart

Thank you

Posted Sep 22, 2008 7:24 UTC (Mon) by alankila (subscriber, #47141) [Link]

In fact with my brief tussle with JACK a year or two back, it seemed to me that the JACK itself might not actually mandate any particular sampling format. It's just that everybody talks float by default, because it is an excellent choice for professional audio work. (If there are any intents to support any other format, they aren't readily apparent.)

The buffers passed between applications were still void * pointers and the only clue for their contents was a simple string describing what kind of format you had requested from JACK, a #define which expanded to a string such as "32-bit pcm floating point audio"...

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 11:57 UTC (Fri) by jengelh (subscriber, #33263) [Link]

PulseAudio and libsydney are just another two things that diminished the chance of me not having to install all the gazillion different audio APIs to get all the end-user software running.

LPC: Linux audio: it's a mess

Posted Sep 21, 2008 23:58 UTC (Sun) by mezcalero (subscriber, #45103) [Link]

Have you heard of this awesome thing called "distributions"? It's really new and fancy and solves all the dependency issues for you! It's your distro's job to integrate PA. You shouldn't install it manually yourself.

Lennart

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 0:26 UTC (Mon) by jengelh (subscriber, #33263) [Link]

Who said something about dependencies? I mean the *amount* of different sound libraries.
`mplayer -ao help` gives an initial list: arts, esd, pulse, jack, openal. And ogg123 depends on libao. That already makes six!

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 1:03 UTC (Mon) by mezcalero (subscriber, #45103) [Link]

So you are complaining that we have so many incomaptible sound systems and all apps have to support them all?

That's what libsydney is supposed to fix.

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 2:26 UTC (Mon) by elanthis (subscriber, #6227) [Link]

Out of curiosity, what does libsydney fix that using the PA sink for ALSA
doesn't? Is it just the portability issue, or is there something more?

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 11:03 UTC (Mon) by mezcalero (subscriber, #45103) [Link]

ALSA is an API for hardware devices. That's root to a lot of problems you encounter when using the PA plugin for libasound. There are certain differences between hardware and virtual devices. Timing works differently, memory management as well.

The devil lies in the details. It's a lot of small issues that make compat difficult. In addition, some fundamental parts, like the reliance on the classic period-based playback model is a weakness that libsydney does not expose. Not exposing periods will greatly increase compat with glitch-free and with network setups. Finally, libsydney allows clients to set all kinds of properties for a stream that make it especially useful for desktop sound servers. (i.e. setting icons for a stream, and so on...)

But there are more things.

Lennart

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 22:06 UTC (Mon) by lysse (guest, #3190) [Link]

...by introducing yet another library...? I can see a flaw in that :)

pa is fine?

Posted Sep 19, 2008 14:11 UTC (Fri) by wingo (subscriber, #26929) [Link]

So, I haven't had any problems with PA. Its big, rewritable buffers that it markets as "glitch-free playback" look great from a power-savings point of view, and compare favorably to what's available in windows these days.

It's not like Alsa is the bee's knees after all, it's fine as a hardware abstraction layer, but as an audio API it is unsatisfying. Better to write to a simpler API, like that offered by Jack or PA.

I also disagree with the suggestion that PA or GStreamer are equivalent choices for "[d]evelopers who want to play audio", as the otherwise-excellent author asserts. They are completely different and complimentary problem spaces, as anyone who has worked with either would realize.

LPC: Linux audio: it's a mess

Posted Sep 19, 2008 15:00 UTC (Fri) by nim-nim (subscriber, #34454) [Link]

> too much power to be acceptable to battery-life-watching desktop users

Please don't conflate desktop with laptop like PA's authors like to do. There is a large proportion of desktop users that do not use laptops (and use wired audio peripherals you absolutely do not want to mute at every possible occasion like PA likes to do).

PA is laptop-oriented. Laptop does not equal desktop.

laptop and desktop

Posted Sep 19, 2008 18:33 UTC (Fri) by dmarti (subscriber, #11625) [Link]

Non-laptop desktop machines are getting more power-aware, though, in order to get the "Energy Star" sticker needed to sell to government and big companies. So the nifty buffer-management features are going to be relevant to people who aren't on battery.

Jim Gettys mentioned this project: AudioFile : a network-transparent system for distributed audio applications. It look as if PulseAudio is picking up on AF's buffering model.

laptop and desktop

Posted Sep 20, 2008 13:41 UTC (Sat) by nim-nim (subscriber, #34454) [Link]

> Non-laptop desktop machines are getting more power-aware, though

Sure, and so are non-desktops. However making power and device hot-plugging core priorities, while at the same time dismissing multichannel sound and the ecosystem built around jack, show the "desktop" PA authors like to talk of is a laptop with simple stereo usb devices, with everything else retrofitted as an afterthought.

laptop and desktop

Posted Sep 22, 2008 0:01 UTC (Mon) by mezcalero (subscriber, #45103) [Link]

Uh? PA supports surround sound just fine. Even does automatic upmixing/downmixing from/to stereo for you.

Also, I don't "dismiss" JACK and the community around it. It's great stuff. But solves a different problem as PA. That's all.

laptop and desktop

Posted Sep 23, 2008 5:04 UTC (Tue) by foom (subscriber, #14868) [Link]

> Uh? PA supports surround sound just fine. Even does automatic upmixing/downmixing
> from/to stereo for you.

Sure, but doesn't transparently support the most common source/sink of surround sound: AC3
DVD audio to AC3 over SPDIF output.

If PA automatically supported AC3 encoding of surround sound to my SPDIF output, without
having to manually configure an external encoder program, that'd be good.

If PA could allow routing of AC3 audio through to the SPDIF port without the decoding/encoding
overhead and resultant quality degredation, that'd be great.

But, if it could do that, *and* transparently fall-over to doing a decode/mix/encode when it
needs to mix another stream into the output (e.g. an alert sound or what have you), it would be
an absolutely amazing achievement: that's something that you just can't do today. And I'd really
like to be able to do that.

Also, volume control is an interesting trick when not decoding the ac3 stream locally; I dunno
much about it, but I'm led to understand there's some metadata in the frames which can be set to
affect the volume on the decoder. Maybe (hand-wave) that could be modified in-flight without
re-encoding the stream. :) If PA could do that, that'd be even another advantage over the
existing ALSA solution.

One can always dream...

laptop and desktop

Posted Sep 23, 2008 13:45 UTC (Tue) by mezcalero (subscriber, #45103) [Link]

PA focuses on PCM right now. Support for pass-thru codecs is on our TODO list.

However, please note that we will probably never support on-the-fly AC3 decoding or encoding due to patents.

Lennart

laptop and desktop

Posted Sep 23, 2008 14:52 UTC (Tue) by foom (subscriber, #14868) [Link]

Sad to hear that about the patents...but is there really any issue with using an AC3 encoder/decoder library if the user happens to have one installed already? (maybe they've already paid the licensing fee to Dolby, e.g. through buying a Dell with Ubuntu preinstalled).

laptop and desktop

Posted Sep 23, 2008 20:07 UTC (Tue) by mezcalero (subscriber, #45103) [Link]

I am happy to merge patches for that.

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 0:04 UTC (Mon) by mezcalero (subscriber, #45103) [Link]

Dunno how you come to the conclusion that PA is laptop oriented. In fact I do all of my development on PA on a desktop machine. And that's where I run it on most of the time.

Sure power saving is not as important for desktops as it is for laptops, but power saving is not all what PA is about. And power saving is good for desktops too .. and the environment. (and also for your power bill)

Lennart

+1 for no userspace daemon

Posted Sep 19, 2008 23:54 UTC (Fri) by literfizzer (subscriber, #31274) [Link]

I'm another who doesn't want to have to run a userspace daemon just to get sound. my experience with such daemons (arts, esd) has been that they add no value to me and introduce headaches like A/V sync problems.

All I want out of the sound system on my Linux box is to be able to reliably play sound with a minimum of overhead and fuss, without delays, skips, or sync problems. i don't want one or more competing userspace daemons making my life unnecessarily complicated. If some web page starts playing crud while I'm listening to music, I'll mute, pause or close it.

+1 for no userspace daemon

Posted Sep 20, 2008 10:20 UTC (Sat) by DonDiego (subscriber, #24141) [Link]

I must say I have to agree. All those daemons solve a ton of problems I never experienced. Well, I guess I'm just old school...

+1 for no userspace daemon

Posted Sep 20, 2008 12:49 UTC (Sat) by nix (subscriber, #2304) [Link]

arts and esd, well, skipping isn't a good description of the problems *I*
encountered with those. It was more 'no sound at all because it
spontaneously crashed' for arts, and 'no sound at all because of a
ludicrous and dysfunctional nonauthentication system' for esd.

Thankfully pulseaudio isn't a pile of crap like arts or esd and so doesn't
have any of those problems that I can see. :)

(Among other things PA runs at realtime priority so that it won't skip
just because of CPU load: if you're swapping so hard that its being
swapped out is problematic, I doubt that whatever was playing the music is
going to be in memory much longer either.)

+1 for no userspace daemon

Posted Sep 22, 2008 0:08 UTC (Mon) by mezcalero (subscriber, #45103) [Link]

Whether audio is handled by a userspace daemon or not is just a technical detail. It shouldn't have any influence on the user interface.

You learned to accept that X11 runs in userspace. PA does for audio was X does for displays. So why is it so difficult to accept PA for audio too?

Also, MS is proud of Vista's userspace sound system. And Apple about CoreAudio. So, we now do something similar on Linux, for similar reasons. That's all.

Lennart

clue, meet literfizzer

Posted Sep 22, 2008 2:47 UTC (Mon) by elanthis (subscriber, #6227) [Link]

You're being contradictory. You want less crap competing for sound output
priority and want glitch-free playback... so you want to go back to a bunch
of applications doing piss-poor direct access to ALSA and/or using the
crappy low-quality dmix mixer that can pretty easily introduce stuttering
and glitches under even remotely heavy load.

You act as if audio on Linux was all peachy and perfect until PA came
along. That's bullshit. It's been horrendous and barely usable all along.
It worked in some situations with moderate quality if you were lucky and
had the right drivers and only used the right applications. PA glitch-free
playback combined with quality mixing offers to finally make audio bearable
for the first time for _everyone_. Once the bugs are worked out of the
audio drivers, applications, and PA itself.

It's a hell of a lot closer to being realized that acceptable graphics
driver or wireless behavior on Linux, I'll tell you that much. In 10 years
of using Linux on mainstream hardware I've yet to have a video card that
works worth a crap without binary blobs, and this laptop here is the first
one I've ever been able to get wireless to work... and it requires hoops to
be jumped through (Broadcom). The problems people are having with audio
are _nothing_ in comparison, but I don't see you bitching about the
constant work being put into X/DRI or the wireless drivers and telling the
devs that they should switch back to the old way (what, plain VESA?
nothing?) because the new stuff is still buggy and incomplete (I don't even
want to get into how much time I spend trying to get video to work on both
old and new video cards -- and sometimes monitors -- compared to audio).

You don't and won't get that without a solid mixing daemon. Keep in
mind that kernel-space can't do floating point math and that reality is not
so great for the quality and performance of sound remixing and resampling.

Let me break this down for you. The software has bugs (that software being
PA, the ALSA drivers, and your user-space applications -- all of them, not
just PA), you have hit the bugs, and you're just short-fused enough to
think that said bugs invalidate the entire model. Wise people instead just
think that the bugs need to be fixed so we can solve the problems that the
old model has already proven incapable of solving.

Maybe PA got pushed into Fedora a little too early. I won't disagree with
that. I had a lot of pain with Fedora 8, too.

Do NOT be stupid and think that "Fedora pushed it too early" is the same as
"the whole model is broken."

+1 for that rant

Posted Sep 22, 2008 5:29 UTC (Mon) by bronson (subscriber, #4806) [Link]

Well said!

It's been said that both Fedora and Ubuntu pushed PA too early. I admit, given how bad Hardy has been, this is a compelling point of view. However, if the distros had not pushed early, the great number of audio bugs that have been fixed in the past six months would probably still exist. I bet the distros could wait 3 more years before pushing and it would still be a horrible experience. Software integration sucks, the only way to shake out the bugs is to integrate and see what breaks.

So, between Phonon and PulseAudio, hopefully audio will work decently in FC10 and Intrepid.

clue, meet literfizzer

Posted Sep 22, 2008 7:10 UTC (Mon) by nix (subscriber, #2304) [Link]

Glitch-free for *everyone*, except those for whom PA refuses to talk to
ALSA (*waves*) or refuses to use the high-def timers that definitely exist
(*waves*).

I really must diagnose this, but trying to find bugs in ALSA's kernel
component is like trying to shovel sewage with a toothbrush. Its
kernel/user interfaces are horrible (and most of both components is
entirely undocumented as far as I can tell: I still have no clue what the
Lisp interpreter is doing in there, for instance. Maybe the docs are just
hiding. Maybe they appeared since the last time I looked at this a couple
of months ago. But, y'know, life's too short: PA mostly works for me now,
and hardly ever glitches even with glitch-free mostly turned off by the
enforced usage of the OSS compatibility layer.)

+1 for no userspace daemon

Posted Sep 22, 2008 22:12 UTC (Mon) by lysse (guest, #3190) [Link]

Same here... with the possible exception of jack - but even then, I can't help but wonder whether the Unix "everything is an entirely passive file" is quite the thing for audio. No problem at all with "everything is a file", but why not say "every file can be opened with either a passive interface or a stream/callback interface"? Wouldn't make much difference for files which are blocks of disk, true, which is why having both is nice - but the stream/callback paradigm is more or less perfect for pipes; after all, it's more or less how they work anyway... and since that's the model jackd presents to its applications, wouldn't the need for it would more or less go away altogether?

+1 for no userspace daemon

Posted Sep 22, 2008 23:22 UTC (Mon) by nix (subscriber, #2304) [Link]

A 'stream/callback interface' can be modelled as a pair of files, one of
which is a source of data and one a sink, or as a single file which you
can read() responses from as you write() stuff to it. Anything you can do
with a TCP/IP socket you can do with a suitably backed file, and we know
very well that you can define protocols over network sockets :)

+1 for no userspace daemon

Posted Sep 23, 2008 6:35 UTC (Tue) by lysse (guest, #3190) [Link]

That just shows that the two models are equivalent in expressive power, like inheritance and delegation. If we couldn't synthesise one from the other, JACK couldn't exist - but it'd still be nicest of all to have both available at the lowest level :)

LPC: Linux audio: it's a mess

Posted Sep 20, 2008 6:44 UTC (Sat) by zlynx (subscriber, #2285) [Link]

One spiffy way to fix the HD-Audio detection problem would be to ask the user to connect a stereo microphone. Then the software could poke sound through various pins until it hears the correct results at the right levels through the mic.

LPC: Linux audio: it's a mess

Posted Sep 20, 2008 11:29 UTC (Sat) by Tuxie (subscriber, #47191) [Link]

Heh, that's probably a bit too much to as from the user. Personally I know very few persons who even own a microphone, let alone a stereo one...

There could be a simple "audio calibration" program in the system settings though. Play a few pure sine waves of various frequences and let the user raise the volume until it starts to distort. Then for line-in/microphone they could experiment with different input volume levels. The values chosen would then be used as "100% volume" in the rest of the system.

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 0:10 UTC (Mon) by mezcalero (subscriber, #45103) [Link]

That's not exactly the definition of "plug'n'play"...

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 4:10 UTC (Mon) by zlynx (subscriber, #2285) [Link]

It seems to me that the only way to get real "plug & play" is full cooperation from each and every manufacturer. You'd need a pin map to know what outputs and inputs are connected to which pins.

If you can get that then of course its a better idea.

LPC: Linux audio: it's a mess

Posted Sep 20, 2008 9:47 UTC (Sat) by magnus (subscriber, #34778) [Link]

PulseAudio seems like a well thought-out project and I have nothing against it, but every audio API "solution" adds another layer to the Linux audio mess.

The only way a standard application audio API could ever be created is if someone writes an abstraction layer that can talk to Alsa, OSS, Jack, PulseAudio, ESD, aRts, MAS, etc, with autodetection of which one to use and dlopen:ing all relevant libraries to avoid broken dependencies. This code should then be included in source form in every program that uses it, because making a shared library out of it would just make it just another API that needs to be supported.

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 0:12 UTC (Mon) by mezcalero (subscriber, #45103) [Link]

libsydney is supposed to be one part of that API you ask for.

Please note however that the audio/mm stack we are heading too is far less complex than you suggest. And the depth of the stack is not more than 3 (i.e. Gst on PA on ALSA at most)

Lennart

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 13:23 UTC (Mon) by nhippi (subscriber, #34640) [Link]

That's ignoring at least phonon and jack, that will continue to exist. Also, how does libsydney fall in? And the embedded people will have openmax...

Still, a good start would be a aggressive purge of legacy solutions - arts, esd, mas, various parts of libalsa.

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 17:09 UTC (Mon) by drag (subscriber, #31333) [Link]

Jack is solving a different problem domain. There is very very significant differences and requirements between people wanting to do real time audio editing vs multimedia playback.

Plus it's possible to play regular applications through PA and into Jack, if you want to use a familiar application or media source.

> Still, a good start would be a aggressive purge of legacy solutions - arts, esd, mas, various parts of libalsa.

Ya. Unfortunately that requires rewriting applications that depend heavily on those API.

For very modern-style applications that use things like Phonon or Gstreamer, this shouldn't be a big problem since those things are designed to be modular. Arts is being taken care of through the KDE folks, who are forced to rewrite their applications to work with QT4 anyways. Mas isn't popular. PA is compatible with Esd.

Then there is OpenAL, SDL, and numerous other much more minor sound APIs.

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 1:09 UTC (Mon) by mezcalero (subscriber, #45103) [Link]

I would have preferred not to quote me with "It's a mess" in the title of the article. Makes me feel slightly comfortable, because not everything is that bad in audio on Linux, and this might give a too negative view of things and the work of quite a few people. It's the words I chose during my talk, but seeing it all over the 'net like this makes me wish I had chosen different words ;-)

Anyway, very informative article!

BTW: Adobe fixed the issues we pointed out to them in Flash 10. Flash 10 works fine with PA. Though it still sucks that Flash 10 is closed source and it takes so long to get things fixed due to that -- they do listen to us.

Lennart

LPC: Linux audio: it's a mess

Posted Sep 22, 2008 10:26 UTC (Mon) by alankila (subscriber, #47141) [Link]

One thing missing from this discussion is a piece of gratitude: I think Pulseaudio at least is on the right track in that it makes it simple to produce output.

With ALSA, you write this 100+ lines long method with about a dozen calls, each which can fail, some which are necessary before other calls in the sequence will work. It is strangely low level, maddeningly flexible, and poorly documented, so it's very difficult to know what you should do.

At least these questions may arise: What is the sufficient condition to get the output mode that is exact to the spec that you requested? Are there format conversions or resampling in the output path? If resampling, where do I set a reasonable algorithm, as pick-nearest or linear interpolation is not acceptable for audio work? What is the minimal program that produces output and yet has some specified latency? How do you write a working full duplex program?

LPC: Linux audio: it's a mess

Posted Sep 24, 2008 20:56 UTC (Wed) by mezcalero (subscriber, #45103) [Link]

At Plumbers one thing became very clear: someone needed to sit down and write up a guide through the jungle of Linux Audio APIs because we have so many and so many are incompatible. And so I did it:

http://0pointer.de/blog/projects/guide-to-sound-apis.html

LPC: Linux audio: it's a mess

Posted Sep 25, 2008 5:04 UTC (Thu) by graydon (subscriber, #5009) [Link]

I'd wager most of us non-sound-expert users just want a single volume control that makes sound louder or quieter, that we can always find, and that always works. No mysteriously silent apps, and no configuration save for the volume buttons physically on the computer.

For as long as I can remember, linux has over-engineered and under-delivered audio. It'd be a pleasant surprise to change that habit. But I'm not holding my breath.

LPC: Linux audio: it's a mess

Posted Oct 5, 2008 14:57 UTC (Sun) by Joes (guest, #54524) [Link]

Why pulse is not ready for prime time, and probably never will be.

A network capable sound daemon does not belong in the mass of machines that have sound cards. This just introduces bugs and eats memory and cpu.

A configured audio stations with Jack and ALSA outperform pulse in every instance.
Separate volume controls
No glitches in sound
Multiple apps recording and playing simultaneously

Plus Low latency!

Every now and then a bad idea like this gets more marketing than engineering.
Just look at mono. hmmm...
Mono gets you a job with microsoft.
Pulse gets you a job with redhat.

Now I understand why it was written.
Pulse programmers may be skilled but there's been many a skilled programmer that has written "great" code that causes more problems that it solves and like mono, encourages others to go down a bad path.

And if you take offense that I'm bashing pulse programmers, well I now have to spend a half hour or more every time I install Fedora, SUSE, Mandriva or Ubuntu to get decent sound working. That wasn't the case before pulse got shoved into the distros prematurely.

LPC: Linux audio: it's a mess

Posted Oct 5, 2008 17:29 UTC (Sun) by nix (subscriber, #2304) [Link]

It's not that you're bashing Lennart, it's more that you're basing your
bashing on content-free innuendo. *Why* doesn't network-capability belong
in 'the mass of machines'? Why does the pulseaudio native network protocol
*plugin* 'introduce[] bugs and eat[] memory and cpu'? Got any evidence? I
doubt it, because there is none. (I mean, yes, obviously deserializing
data flowing over the network costs a bit of CPU time, but it's utterly
insignificant, and you don't even pay that cost unless you're doing
network streaming.)

I don't even know what your criteria for measuring 'performance' are,
given that your examples are matched by pulseaudio (given hardware that
supports hi-res timers, at least).

Personally I use Pulseaudio for one reason: *API compatibility*.
Pulseaudio can emulate virtually everything (other than Jack) so I don't
need to run a mass of battling incompatible sound daemons anymore: I can
just run one. (And yes, it *can* mix stuff from multiple sources and sinks
freely.)

The rest of your post is sub-Slashdot ad hominems that aren't worth
responding to.

Aug	SEP	Oct
	01
2008	2009	2010