User:Password:

History: Underdetermined speech and music mixtures

Comparing version 22 with version 33

@@ -Lines: 1-12 changed to +Lines: 1-15 @@

!::Underdetermined-speech and music mixtures::

- We propose to repeat the [http://sisec2011.wiki.irisa.fr/tiki-index.php?page=Underdetermined+speech+and+music+mixtures|underdetermined-speech and music mixtures] task in SiSEC2011.

+ We propose to repeat the [http://sisec2011.wiki.irisa.fr/tiki-index.php?page=Underdetermined+speech+and+music+mixtures|underdetermined-speech and music mixtures task in SiSEC2011].

- !! Test data

+ !Results />Results for development sets: [http://www.onn.nii.ac.jp/sisec13/evaluation_result/UND/underdetermined_dev1_all.html|dev1], [http://www.onn.nii.ac.jp/sisec13/evaluation_result/UND/underdetermined_dev2_all.html|dev2], [http://www.onn.nii.ac.jp/sisec13/evaluation_result/UND/underdetermined_dev3_all.html|dev3].
Results for test sets: [http://www.onn.nii.ac.jp/sisec13/evaluation_result/UND/underdetermined_test_all.html|test], [http://www.onn.nii.ac.jp/sisec13/evaluation_result/UND/underdetermined_test2_all.html|test2], [http://www.onn.nii.ac.jp/sisec13/evaluation_result/UND/underdetermined_test3_all.html|test3].

+ !! Test data

We have three datasets:

- __Download [http://www.irisa.fr/metiss/SiSEC10/underdetermined/test.zip|test.zip] (22 MB)__　(former test data of [http://sisec2008.wiki.irisa.fr/tiki-index.php?page=Under-determined+speech+and+music+mixtures|SiSEC2008].)
__Download [http://www.irisa.fr/metiss/SiSEC10/underdetermined/test2.zip|test2.zip] (16 MB)__ (former test data of [http://sisec2010.wiki.irisa.fr/tiki-index.php?page=Underdetermined-+speech+and+music+mixtures|SiSEC2010].)
__Download [http://www.irisa.fr/metiss/SiSEC11/underdetermined/test3.zip|test3.zip] (8.6MB)__(~~red:fresh~~ data for SiSEC2011. This is the 3-ch mixtures of 4 speech sources.)

+ __Download [http://www.irisa.fr/metiss/SiSEC10/underdetermined/test.zip|test.zip] (22 MB)__　(test data of [http://sisec2008.wiki.irisa.fr/tiki-index.php?page=Under-determined+speech+and+music+mixtures|SiSEC2008].)
__Download [http://www.irisa.fr/metiss/SiSEC10/underdetermined/test2.zip|test2.zip] (16 MB)__ (test data of [http://sisec2010.wiki.irisa.fr/tiki-index.php?page=Underdetermined-+speech+and+music+mixtures|SiSEC2010].)
__Download [http://www.irisa.fr/metiss/SiSEC11/underdetermined/test3.zip|test3.zip] (8.6MB)__(test data of [http://sisec2011.wiki.irisa.fr/tiki-index.php?page=Underdetermined%20speech%20and%20music%20mixtures|SiSEC2011]. This is the 3-ch mixtures of 4 speech sources.)

!!!test.zip

@@ -Lines: 14-18 changed to +Lines: 17-21 @@

*__instantaneous mixtures__ (static sources scaled by positive gains)
*__live recordings__ (static sources played through loudspeakers in a meeting room, recorded one at a time by a pair of omnidirectional microphones and subsequently added together)

- **__CAUTION__: For SiSEC2011, we will ~~red:NOT~~ evaluate "__synthetic convolutive mixtures__" (static sources filtered by synthetic room impulse responses simulating a pair of omnidirectional microphones via the Roomsim toolbox).

+ **__CAUTION__: For SiSEC2013, we will ~~red:NOT~~ evaluate "__synthetic convolutive mixtures__" (static sources filtered by synthetic room impulse responses simulating a pair of omnidirectional microphones via the Roomsim toolbox).

The room dimensions are the same for synthetic convolutive mixtures and live recordings (4.45 x 3.55 x 2.5 m). The reverberation time is set to either 130 ms or 250 ms and the distance between the two microphones to either 5 cm or 1 m, resulting in 9 mixing conditions overall.

@@ -Lines: 51-55 changed to +Lines: 54-57 @@

__Licensing Issue:__ These files are made available under the terms of the Creative Commons [http://creativecommons.org/licenses/by-nc-sa/2.0/|Attribution-NonCommercial-ShareAlike 2.0] license. The authors are Shannon Hurley, Nine Inch Nails, AlexQ (Alexander Lozupone), Mokamed, Carl Leth and Jim's Big Ego for music source signals and Hiroshi Sawada for mixture signals.

!!!test3.zip

@@ -Lines: 67-78 changed to +Lines: 69-78 @@

!! Development data

__Download [http://www.irisa.fr/metiss/SiSEC10/underdetermined/dev1.zip|dev1.zip] (91 MB)__
__Download [http://www.irisa.fr/metiss/SiSEC10/underdetermined/dev2.zip|dev2.zip] (47 MB)__

- (Both are the former development data of [http://sisec2008.wiki.irisa.fr/tiki-index.php?page=Under-determined+speech+and+music+mixtures|SiSEC2008] and [http://sisec2010.wiki.irisa.fr/tiki-index.php?page=Underdetermined-+speech+and+music+mixtures|SiSEC2010]) />__Download [http://www.irisa.fr/metiss/SiSEC11/underdetermined/dev3.zip|dev3.zip] (47 MB)__ (~~red:Fresh~~ development data for 3-ch mixtures.)

+ __Download [http://www.irisa.fr/metiss/SiSEC11/underdetermined/dev3.zip|dev3.zip] (47 MB)__
(The former development data of [http://sisec2008.wiki.irisa.fr/tiki-index.php?page=Under-determined+speech+and+music+mixtures|SiSEC2008], [http://sisec2010.wiki.irisa.fr/tiki-index.php?page=Underdetermined-+speech+and+music+mixtures|SiSEC2010] and [http://sisec2011.wiki.irisa.fr/tiki-index.php?page=Underdetermined%20speech%20and%20music%20mixtures|SiSEC2011].)

The data consist of Matlab MAT-files and WAV audio files, that can be imported in Matlab using the commands load and wavread respectively. These files are named as follows:

@@ -Lines: 86-90 changed to +Lines: 86-89 @@

where <srcset> is a shortcut for the set of source signals, <mixtype> for a shortcut for the mixture type, <reverb> the reverberation time, <spacing> the microphone spacing and <j> the source index.

All mixture signals and source image signals have 10s duration. Music source signals have 11s duration to avoid border effects within convolutive mixtures. The last 10s are then selected once the mixing system has been applied.

@@ -Lines: 99-120 changed to +Lines: 98-114 @@

*dev3_<srcset>_<mixtype>_<reverb>_<spacing>_sim_<j>.wav: stereo contribution of a source signal to the two mixture channels
*dev3_<srcset>_<mixtype>_<reverb>_<spacing>_mix.wav: stereo mixture signal

__Licensing issue: __ These files are made available under the terms of the Creative Commons [http://creativecommons.org/licenses/by-nc-sa/2.0/|Attribution-NonCommercial-ShareAlike 2.0] license. The authors are Another Dreamer and Alex Q for music source signals and Hiroshi Sawada, Shoko Araki and Emmanuel Vincent for mixture signals.

!! Tasks

- The source separation problem has been split into four tasks:

+ The source separation problem has been split into three tasks:

## __source counting__ (estimate the number of sources)
## __source signal estimation__ (estimate the mono source signals)
## __source spatial image estimation__ (estimate the stereo contribution of each source to the two mixture channels)

!! Submissions

Each participant is asked to submit the results of his/her algorithm for tasks 2 and/or 3
* over all or part of "test", "test2" and "test3".

- * over all or part of "dev2", if his/her algorithm was not previously submitted to the [http://www.irisa.fr/metiss/SASSEC07/|Stereo Audio Source Separation Evaluation Campaign] nor [http://sisec2008.wiki.irisa.fr/tiki-index.php?page=Under-determined+speech+and+music+mixtures|SiSEC2008], so as to assess improvements compared to that campaign.
*and all or part of "dev3".

+ * over all or part of "dev2" and "dev3", if his/her algorithm was not previously submitted to the [http://www.irisa.fr/metiss/SASSEC07/|Stereo Audio Source Separation Evaluation Campaign], [http://sisec2008.wiki.irisa.fr/tiki-index.php?page=Under-determined+speech+and+music+mixtures|SiSEC2008], [http://sisec2010.wiki.irisa.fr/tiki-index.php?page=Underdetermined-+speech+and+music+mixtures|SiSEC2010] nor [http://sisec2011.wiki.irisa.fr/tiki-index.php?page=Underdetermined%20speech%20and%20music%20mixtures|SiSEC2011], so as to assess improvements compared to those campaigns.

The results for task 1 may also be submitted.

@@ -Lines: 138-149 changed to +Lines: 132-140 @@

Note that the submitted audio files will be made available on a website under the terms of the Creative Commons [http://creativecommons.org/licenses/by-nc-sa/2.0/|Attribution-NonCommercial-ShareAlike 2.0] license.

!! Reference software

Please refer the [http://sisec2008.wiki.irisa.fr/tiki-index.php?page=Under-determined+speech+and+music+mixtures|previous SiSEC2008 page].

!! Evaluation criteria

We propose to use the same evaluation criteria as in SiSEC 2010, except that the order of the estimated sources must be recovered.

History

Legend: v=view, c=compare, d=diff

Date	User	Edit Comment	Version	Action
Thu 01 of Aug., 2013 02:05 CEST	admin		33 Current	v
Tue 30 of July, 2013 06:30 CEST	admin		32	v c d
Tue 30 of July, 2013 04:31 CEST	admin		31	v c d
Wed 06 of Mar., 2013 11:07 CET	admin	Correction of grammatical error, by Shigeki Miyabe	30	v c d
Wed 06 of Mar., 2013 11:03 CET	admin	Correction of link of the previous correction by Shigeki Miyabe	29	v c d
Wed 06 of Mar., 2013 10:58 CET	admin	Correction about years by Shigeki Miyabe	28	v c d
Wed 14 of Nov., 2012 23:13 CET	admin		27	v c d
Wed 14 of Nov., 2012 23:11 CET	admin		26	v c d
Wed 14 of Nov., 2012 04:22 CET	admin		25	v c d
Wed 14 of Nov., 2012 04:05 CET	admin		24	v c d
Mon 12 of Nov., 2012 23:20 CET	admin		23	v c d
Mon 12 of Nov., 2012 23:19 CET	admin		22	v c d

History: Underdetermined speech and music mixtures

Comparing version 22 with version 33

History

Sidebar

Menu

Sidebar

Google Search