SDIF: Sound Description Interchange Format
Effective musical application of the many sound analysis/synthesis tools
now available has been hampered by the absence of a common representation
for analyzed data. Although standard file formats abound for time-domain
audio samples, each institution has its own file formats for other kinds
of sound descriptions.
Sound descriptions supported by SDIF include
SDIF is optimized both for streaming and for archiving. The SDIF format
is a sequence of time-tagged frames (IFF chunks), appearing in chronological
order, with multiple kinds of frames allowed in a single file or stream.
Thus, a single SDIF file might contain STFT results, a pitch estimate envelope,
and sinusoidal tracks from the same original sound. A library of standard
frame types defines formats for storing the common sound representations
listed above.
Frames consist of some number of 2D matrices of floating point numbers,
with each column corresponding to a parameter like frequency or amplitude
and each row representing an object like a filter, sinusoid, or noise band.
A few optional chunk types can appear at the beginning of an SDIF file
or stream, e.g., file creation info and one-time global information needed
to configure synthesis software to interpret the chunk data.
This effort was suggested by Xavier Rodet of IRCAM. Discussions with
scholars and vendors at the 1995 ICMC confirmed the need for this standard
in the academic and commercial communities . Adrian Freed coined the name
and invited ICMC delegates to participate in the development work. Discussions
over the months led to new requirements for the standard particularly in
the area of internet applications. Also, although originally conceived of
as a portable (and not necessarily efficient) interchange format, CNMAT
has decided to use it as the primary working format for its sound analysis
and synthesis tools. This has resulted in a standard grounded in real needs
and field tested.
CNMAT and IRCAM are nearing consensus on the final details of the format,
and expect to publish a specification in the next few months. Already there
is a library for reading and writing SDIF, and a growing set of tools for
manipulating SDIF files.
We encourage feedback from all interested uses of SDIF.
After we standardize the SDIF formats for sound representations now in
common practice, we expect to experiment with the following topics and eventually
add them to the standard: