• DocumentCode
    3739232
  • Title

    Data Mining Sound Archives: A New Scalable Algorithm for Parallel-Distributing Processing

  • Author

    Peter J; Dugan;Holger Klinck;John A. Zollweg;Christopher W. Clark

  • Author_Institution
    Bioacoustics Res. Program, Cornell Lab. of Ornithology Cornell Univ., Ithaca, NY, USA
  • fYear
    2015
  • Firstpage
    768
  • Lastpage
    772
  • Abstract
    This paper discusses a new algorithm, called the acoustic data-mining accelerator (ADA), which was developed to mine large sound archives for signals of interest including animal vocalizations. Background information on the development of ADA is provided, summarizing various projects that have utilized this technology since 2009. Performance was evaluated by comparing runtimes and efficiency metrics for two marine mammal detection algorithms that were applied to a 3-week single channel acoustic data set (sampled at 192 kHz and with 16 bit resolution). A total of four configurations (1, 8, 16 and 64 workers) demonstrated processing scalability. Results showed that each detection algorithm successfully processed the data set in all four configurations without changing the ADA algorithm. The fastest case (64 workers), had a total runtime performance of 1.5 hours; making the ADA 13 times more efficient than the serial case. Using a single worker it took more than 18 hours to process the same 3-week data set. Concurrent processing of both data-mining algorithms using 64 workers showed the highest efficiency gain (23x) compared to sequentially processing the data with a single worker.
  • Keywords
    "Algorithm design and analysis","Data mining","Runtime","Computational modeling","Scalability","Mathematical model","Concurrent computing"
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshop (ICDMW), 2015 IEEE International Conference on
  • Electronic_ISBN
    2375-9259
  • Type

    conf

  • DOI
    10.1109/ICDMW.2015.235
  • Filename
    7395746