Dear Dr. Jan Felcyn,
You may try the robot audition open source software called "HARK".
HARK is developed as an audio-equivalent to OpenCV.
It supports any kind of microphone array configuration, and
provides sound source localization (2 algorithms), sound source separation
(12 algorithms), and interface to automatic speech recognition (Julius and
Kaldi). HARK has been downloaded more than 120 K times. Ubuntu and
Windows versions are available.
Some papers include:
K. Nakadai et al, "Kazuhiro Nakadai, Toru Takahashi, Hiroshi G. Okuno, Hirofumi Nakajima, Yuji Hasegawa, Hiroshi Tsujino:
Design and Implementation of Robot Audition System "HARK".
Hiroshi G. Okuno, Kazuhiro Nakadai:
ROBOT AUDITION: ITS RISE AND PERSPECTIVES,
Proceedings of 2015 International Conference on
Acoustics, Speech and Signal Processing (ICASSP 2015),
pp.5610-5614, SS-L3.1,
Brisbane, Australia, April 19-24 (22), 2015.
doi:10.1109/ICASSP.2015.7179045 Kazuhiro Nakadai, Hiroshi G. Okuno, Takeshi Mizumoto.
Development, Deployment and Applications of Robot Audition Open Source Software HARK.
Journal of Robotics and Mechatronics,
Vol.27, No.1 (Feb. 2017), pp.16-25.
I organized a special issue on robot audition technologies at
Journal of Robotics and Mechatronics, Vol.29, No.f (Feb. 2017).
Some papers cover human-robot interactions, musical robots,
sound processing for a hose-shaped rescue robot, microphone
array processing for UAV, bird singing scene analysis, and
frog chorusing.analysis,
Enjoy,
- Gitchang -
Hiroshi "Gitchang" Okuno
Professor, Waseda Unviersity
Professor Emeritus, Kyoto University