This article was first published on GamaSutra by Theo Nogueira on 7/19/19.
On our current day and age of game development, almost all AAA game studios use audio middleware and so does a good amount of indie game developers. However, a lot of indie developers wonder what is it, where does it come from and why do they need it as a part of their game?
What will you find in this blog:
– What & Why?
– How can #middleware improve my game?
– Multiplying & Diversifying – Footsteps
– Making the Ambience More Realistic
– Complex and Long Dialog Script
– Other Important Factors
– And what about Adaptive Music?
– Quick note on VR/AR/MR and 360 audio
Some will say:
1.”Unity has a really good audio tool and I don’t need more than that,” or
2.”I can make all that by scripting it for you,” or
3.”It’s too expensive and I don’t exactly know how much I’ll pay to have it in my game,” or
4.”It’s very complicated and my programmers and designers will not want to learn it!”
Well, first, lets demystify some of that:
1. Game Engines have simple audio tools that are sufficient for certain games, but they are not enough anymore for the level of audio quality and implementation that most games require, whether they are Indie or #AAA.
2. Yes, programmers could definitely create a whole new middleware application specific for your game. However, your sound designer will have to learn how to use it, deal with it’s bugs and it could require a lot of communication between your programmers and sound designers. That takes time, something most of us cannot afford. The amount of time spent is not worth it, when your sound designer/composer can use a tool they are already comfortable with. That will get the job done giving the programmers only simple parameters and a few scripts, which will make the audio run exactly as planned. There is copious support for the top audio middleware solutions, and the game audio community also provides help and support for these tools. Custom solutions do not have this underappreciated benefit.
3. Yes, you may have to for the usage of middleware if your game is very successful or has a large development budget. I’ll be breaking down the prices below, and most middleware has a free option as well. In most cases, the cost is justified and reasonable, and the benefits far outweigh those costs. In many cases, middleware can actually reduce the costs since it does a lot of the heavy lifting and can save tons of engineering time.
4. As for being complicated, your programmers will not have to learn how to use the middleware itself. Usually, calling the middleware to play a sound works the same way you would play a sound in the engine, so the amount of additional work is often minimal. The #SoundDesigner / #Composer job is to make the implementation process through #Middleware easier for the programmer, who in some cases will only have to write a few audio scripts to get the middleware connected to the game engine and the rest was already done inside the middleware. It is called middleware because it sits in between the programmer and the sound designer/composer and allows each to do their function to the maximum.
So, through this article, I want to show you why should you consider audio middleware for your game and how it can improve the usability of audio and make transitions very smooth.
To start, I’d like to point out the most used audio middleware on the Market:
And below you can see a simple comparison between them. However, please consult each companies respective website for the latest prices and terms. For example WWISE also has a free option based on the total number of assets in the project.
All #middleware have their pros and cons, however, for the purpose of this article, I am going to talk more specifically about #FMOD and #Wwise. #Fabric is not as used nowadays and I personally have no experience with #Criware, which is mostly used by big studios Game companies in Asia, such as #Capcom. And as for #Elias, I wrote two blogs about the software, which I really enjoy, however, as I want to focus on both sound design and adaptive music on this blog, #Elias would only fit one side and you can check my other blogs “From Linear to Adaptive – How a Film Composer Fell in Love with Adaptive Music” and “From Linear to Adaptive – A Deeper look into Elias Studio 3 MIDI Capabilities” to find more about this amazing tool.
How Can #Middleware Improve My Game?
There are many ways that middleware can improve your game. The first one is when it comes to creating organic sounds, for example a vehicle engine. On the example below, I created a space racer engine using FMOD and Wwise. In FMOD, you can see I used 2 different parameters, RPM and Load. RPM will show the speed of the racer as it changes gears (I know, why would a space vehicle change gears? Because it sounds cool!), and Load tells if the vehicle is accelerating or decelerating.
Disclaimer Note: On the videos below, you will not hear me talking. These are just simple sessions to demonstrate what you can accomplish so easily and accurately on middleware. The sessions were created specifically for this blog, so there’s not many complicated features being used as they would make it harder to understand the point. However, Middleware sessions tend to contain all of their audio elements together and use a lot of events, tracks and plugins to achieve their goal.
While in FMOD you create the audio tracks still on a linear-format thought-process (although it is still non-linear), in Wwise, there is a lot more freedom when choosing how an audio file will react to different switches and states.
Another way that audio middleware can improve your game is to improve loops and sounds that need to be repeated, so they never sound exactly the same and your player never gets tired of them. Some examples relate to footsteps, ambiences, and a way to randomize vocal sounds and dialog. Below you can see an example from a Wwise session where I created footsteps sounds on three different materials. There are only four footsteps sounds for material, but because of randomization of pitch, order and effects, they sound like there are thirty footsteps recorded, while you save space on your data budget and store only four sounds of each. This can also be accomplished with only two or even one sound of each.
Ambiences can be tricky, mostly because players tend to spend a good amount of time in a single location. So, to avoid a simple loop of a one-minute countryside morning and hearing the same birds over and over, we turn to middleware. By using them, you can have multiple base ambiences of a couple minutes and add multiple insects, birds and other nature sounds in a random order. Doing that, you are able to create a much more realistic-sounding location. And moreover, you can use middleware to smoothly change the time of day and transition between locations with a correct fade curve.
Below you can hear an example of how I created a countryside ambience and make it go from day to night and from no-rain to heavy rain with thunders. Notice that the birds in the morning do not repeat often and their tracks play randomly, making it harder for the player to identify a looping-point. The thunders follow the same suit, never playing the same track the same way and always playing on random moments.
All middleware are better implementers than the game engines when it comes to dialog. If you think about the fact that you might have twenty to fifty characters, each with over ten lines in twenty different languages and you still need all of the audio to play the correct way and perfect timing, with the same effects. With middleware you can organize them all correctly and make sure they have the same effects. Another important use of middleware is if you utilize multiple takes, or lines that can be used for the same goal, you can group them and either call them randomly, or choose an order. Check the very simple dialog demo on FMOD below.
Lets say you have a character that goes inside a cave. The reverberation within that cave should completely change the character’s dialog, the sound effects and the ambience. However, unless you want to, usually the music should stay the same. In a game engine, you would probably need to program the reverb and delay to react on every single one of those tracks. However, in middleware, you only have to create a state in which all of the sounds you choose will change unnoticed. With middleware, you can change the sounds on the go and they will automatically change within the game, without any issues in a matter of minutes.
FMOD and Wwise have capabilities of connection with other software and external plugins that allow audio artists to create exactly the sound they, together with the game developers envision.
I talked extensively about Adaptive music on my blogs about Elias, which can check out through these links, if you have not yet read them:
Music in Films and TV Series are cut to fit the scenes and to hit certain points organically. However, in a game, it is very hard to transition a track into the perfect hit point without making it fully adaptive. And to do that, you MUST use audio middleware. In the demo below I created two moments: Scene 1 and Battle. Scene one has no extra layers, but one of its tracks randomly chooses synths that complements its top line. It also has an intro that will only play once and go into the Scene cue, which will loop until a call for a change.
As for the Battle cue, it has both an intro and an outro. It also has four layers. To control those layers, I created a parameter called Battle Level from 1 to 4. Lastly, the MusicState parameter controls the moment where the player is located. If the player is on Scene 1, the Music State is at 1, however, if the player is in combat, the Music State is at 2, triggering the Battle Sequence. The battle levels can be used in multiple ways, such as the danger the player is in, or how close to the end of the battle the player is in.
When it comes to VR/AR/MR, middleware is also on it. There are plugins constantly being developed that can fulfill the spatial audio needs within games through middleware. The most commonly used at the time of this blog writing is the Google VR plugin, which allows you to correctly place sounds within the field for VR.
Audio middleware is a tool that can improve your game immensely, whether it is for a AAA game or a simple Tetris indie remake. And besides all the points I made above, middleware can push audio data budget down by a lot. Both FMOD and Wwise have the capability of compress-exporting it accordingly to fit that budget, by choosing the sound format, its bit rate and sample rate. Moreover, through middleware you can add effects and randomization to all kinds of sounds, without the need of various recordings of the same sound, which in return also brings down the audio data budget.
Thus, dear game designer and developer friend, I hope this can shine some light on your understanding why your sound designer or composer is asking you to use one of these in your game. They see the value of it and understand the importance of a great soundtrack to make your game sound perfect.
If you have any questions, I would be more than glad to answer them. I am very well versed in both sound design and music and I have full knowledge of FMOD and Wwise. You can contact me via email to email@example.com