Audio Compression using a Modified Vector Quantization algorithm for Mastering Applications

Shajin Prince; Bini D; Alfred Kirubaraj A; Samson Immanuel J; Surya M

Audio Compression using a Modified Vector Quantization algorithm for Mastering Applications

Authors

Shajin Prince Karunya Institute of Technology and Sciences
Bini D Karunya Institute of Technology and Sciences
Alfred Kirubaraj A Karunya Institute of Technology and Sciences
Samson Immanuel J Karunya Institute of Technology and Sciences
Surya M Roever Engineering College

Abstract

Audio data compression is used to reduce the transmission bandwidth and storage requirements of audio data. It is the second stage in the audio mastering process with audio equalization being the first stage. Compression algorithms such as BSAC, MP3 and AAC are used as standards in this paper. The challenge faced in audio compression is compressing the signal at low bit rates. The previous algorithms which work well at low bit rates cannot be dominant at higher bit rates and vice-versa. This paper proposes an altered form of vector quantization algorithm which produces a scalable bit stream which has a number of fine layers of audio fidelity. This modified form of the vector quantization algorithm is used to generate a perceptually audio coder which is scalable and uses the quantization and encoding stages which are responsible for the psychoacoustic and arithmetical terminations that are actually detached as practically all the data detached during the prediction phases at the encoder side is supplemented towards the audio signal at decoder stage. Therefore, clearly the quantization phase which is modified to produce a bit stream which is scalable. This modified algorithm works well at both lower and higher bit rates. Subjective evaluations were done by audio professionals using the MUSHRA test and the mean normalized scores at various bit rates was noted and compared with the previous algorithms.

Author Biographies

Shajin Prince, Karunya Institute of Technology and Sciences

Assistant Professor

Department of Electronics and Communication Engineering

Bini D, Karunya Institute of Technology and Sciences

Assistant Professor

Department of Electronics and Communication Engineering

Alfred Kirubaraj A, Karunya Institute of Technology and Sciences

Assistant Professor

Department of Electronics and Communication Engineering

Samson Immanuel J, Karunya Institute of Technology and Sciences

Assistant Professor

Department of Electronics and Communication Engineering

References

Sinha D and C. Sundberg, “Unequal error protection (UEP) for perceptual audio coders,” IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP), 1999, pp. 2423–2326.

Mondal, U.K, “Achieving lossless compression of audio by encoding its constituted components (LCAEC),” Innovations Syst Softw Eng Vol 15, 2019, pp.75–85.

Huang, H. Shu, and R. Yu, “Lossless Audio Compression in The New IEEE Standard for Advanced Audio Coding,” IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP), 2014, pp. 6934 – 6938.

M. Sandler and D. Black, “Scalable audio coding for compression and loss resilient streaming,” IEEE Proceeding. -Visual. Image Signal Processing., Vol. 153, No. 3, 2006, pp. 331–339.

Srivatsan Kandadai & Charles D. Creusere, “Scalable Audio Compression at Low Bitrates,” IEEE Transactions on Audio, Speech, and Language Processing. Vol.16, No.5, 2008, pp. 969- 979.

Pramila Srinivasan and Leah H. Jamieson, “High-Quality Audio Compression Using an Adaptive Wavelet Packet Decomposition and Psychoacoustic Modeling,” IEEE Transactions on Signal Processing, Vol. 46, No.4, 1998, pp.1085 – 1093.

Manas Arora,Neha Maurya, “Audio Compression in MPEG Technology,” International Journal of Scientific and Research Publications. Vol.3, No.12, 2013, pp.1-4.

D. Pan, “A tutorial on MPEG/audio compression,” IEEE Multimedia. Vol. 2, No.2, 1995, pp.60-74.

Moreno-Alvarado R.G, Mauricio Martinez-Garcia, Mariko Nakano and Héctor M. Pérez, “DCT-Compressive Sampling of Multifrequency sparse audio signals,” IEEE Latin-America Conference on Communications, 2014.

Subbarao V. Wunnava, and Craig Chin, “Multilevel Data Compression Techniques for Transmission of Audio over Networks. Proceedings,” IEEE South east Conference, 2001, pp.234 – 238.

Florin Ghido, “An Asymptotically Optimal Predictor for Stereo Lossless Audio Compression,” Proceedings of the Data Compression Conference, 2003.

Rongshan Yu and Chi Chung Ko, “Lossless Compression of Digital Audio Using Cascaded RLS-LMS Prediction.” IEEE Transactions on Audio, Speech, and Language Processing, Vol.11, No.6, 2003, pp.532 – 537.

Teddy Surya Gunawan, M. Khalif Mat Zain, Fathiah Abdul Muin and Mira Kartiwi, “Investigation of Lossless Audio Compression using IEEE 1857.2 Advanced Audio Coding,” Indonesian Journal of Electrical Engineering and Computer Science Vol.6, No.2, 2017, pp.422 – 430.

Anthony Griffin, Toni Hirvonen, Christos Tzagkarakis, Athanasios Mouchtaris and Panagiotis Tsakalides, “Single-Channel and Multi-Channel Sinusoidal Audio Coding Using Compressed Sensing,” IEEE Transactions on Audio, Speech, and Language Processing. Vol.19, No.5, 2010, pp.1382 – 1395.

Rubem J. V. de Medeiros, Edmar C. Gurj˜ao and Joˆao M. de Carvalho, “Lossy Audio Compression Via Compressed Sensing. Proceedings of the Data Compression Conference,2010.

Duarte, M. Davenport, D. Takhar, J. Laska, T. Sun, K. Kelly, and R. Baraniuk, “Single- pixel imaging via compressive sampling,” IEEE Signal Processing Magazine. Vol.25, No.2, 2008, pp.83– 91.

Larsen M.H, M. G. Christensen, and S. H. Jensen, “Variable dimension trellis-coded quantization of sinusoidal parameters,” IEEE Signal Processing Letters. Vol.15, 2008, pp.17–20.

Vafin R and W. B. Kleijn, “Entropy-constrained polar quantization and its application to audio coding,” IEEE Transactions on Audio, Speech, and Language Processing, Vol.13, No. 2, 2005, pp.220–232.

Cecchi, S.; Virgulti, M.; Primavera, A.; Piazza, F.; Bettarelli, F.; Li, J, “Investigation on audio algorithms architecture for stereo portable devices,” Journal of Audio Engineering Society, Vol.64, 2016, pp.175–188.

Creusere C, “Understanding perceptual distortion in MPEG scalable audio coding. IEEE Transactions on Audio, Speech, and Language Processing, Vol.13, No.3, 2005, pp. 422–431.

Downloads

Published

2024-04-19

Issue

Vol. 69 No. 2 (2023)

Section

Digital Signal Processing

License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

1. License

The non-commercial use of the article will be governed by the Creative Commons Attribution license as currently displayed on https://creativecommons.org/licenses/by/4.0/.

2. Author’s Warranties

The author warrants that the article is original, written by stated author/s, has not been published before, contains no unlawful statements, does not infringe the rights of others, is subject to copyright that is vested exclusively in the author and free of any third party rights, and that any necessary written permissions to quote from other sources have been obtained by the author/s. The undersigned also warrants that the manuscript (or its essential substance) has not been published other than as an abstract or doctorate thesis and has not been submitted for consideration elsewhere, for print, electronic or digital publication.

3. User Rights

Under the Creative Commons Attribution license, the author(s) and users are free to share (copy, distribute and transmit the contribution) under the following conditions: 1. they must attribute the contribution in the manner specified by the author or licensor, 2. they may alter, transform, or build upon this work, 3. they may use this contribution for commercial purposes.

4. Rights of Authors

Authors retain the following rights:

- copyright, and other proprietary rights relating to the article, such as patent rights,

- the right to use the substance of the article in own future works, including lectures and books,

- the right to reproduce the article for own purposes, provided the copies are not offered for sale,

- the right to self-archive the article

- the right to supervision over the integrity of the content of the work and its fair use.

5. Co-Authorship

If the article was prepared jointly with other authors, the signatory of this form warrants that he/she has been authorized by all co-authors to sign this agreement on their behalf, and agrees to inform his/her co-authors of the terms of this agreement.

6. Termination

This agreement can be terminated by the author or the Journal Owner upon two months’ notice where the other party has materially breached this agreement and failed to remedy such breach within a month of being given the terminating party’s notice requesting such breach to be remedied. No breach or violation of this agreement will cause this agreement or any license granted in it to terminate automatically or affect the definition of the Journal Owner. The author and the Journal Owner may agree to terminate this agreement at any time. This agreement or any license granted in it cannot be terminated otherwise than in accordance with this section 6. This License shall remain in effect throughout the term of copyright in the Work and may not be revoked without the express written consent of both parties.

7. Royalties

This agreement entitles the author to no royalties or other fees. To such extent as legally permissible, the author waives his or her right to collect royalties relative to the article in respect of any use of the article by the Journal Owner or its sublicensee.

8. Miscellaneous

The Journal Owner will publish the article (or have it published) in the Journal if the article’s editorial process is successfully completed and the Journal Owner or its sublicensee has become obligated to have the article published. Where such obligation depends on the payment of a fee, it shall not be deemed to exist until such time as that fee is paid. The Journal Owner may conform the article to a style of punctuation, spelling, capitalization and usage that it deems appropriate. The Journal Owner will be allowed to sublicense the rights that are licensed to it under this agreement. This agreement will be governed by the laws of Poland.

By signing this License, Author(s) warrant(s) that they have the full power to enter into this agreement. This License shall remain in effect throughout the term of copyright in the Work and may not be revoked without the express written consent of both parties.

Audio Compression using a Modified Vector Quantization algorithm for Mastering Applications

Authors

Abstract

Author Biographies

Shajin Prince, Karunya Institute of Technology and Sciences

Bini D, Karunya Institute of Technology and Sciences

Alfred Kirubaraj A, Karunya Institute of Technology and Sciences

Samson Immanuel J, Karunya Institute of Technology and Sciences

References

Downloads

Published

Issue

Section

License

Information

Current Issue