GD_Spectrogram

NOTE: this project is under heavy development at the moment and this description may not reflect recent (potentially large) changes

GD_Spectrogram

This is a demo for capturing spectrograms, mel-scale spectrograms, and mel-scale cepstral coefficients (MFCCs), and identifying formants using GDScript.

The demo includes a scene + script for capturing audio over time and generating images on close.

The demo also includes a scene + script for showing a spectrogram in realtime over a short time window.

Both demos identify the first 4 formants in the analyzed audio. The formants are drawn in green on the spectrogram image. The realtime demo uses a faster/less-accurate dynamic compression method for this purpose.

Both demos give a realtime visualization of the bucket levels using progress bars, and labels for the formant frequencies.

The spectrogram images look like:

The mel-scale spectrogram images look like:

The MFCC images look like:

NOTE

the non-relatime demo generates the capture on exit_tree. so that means you have to close the app with the X on the window for it to work. Pressing the stop debugging button doesnt trigger the signal. You can bind this to anything, I was just lazy for prototyping. It starts capture as soon as you press play.

you may need to adjust the FFT size or NUM_BUCKETS to suit your needs and/or hardware capabilities.

the .gitignore is set to ignore the .tres files in the captures folder bc they can be too large for github depending on the length of the capture.

CREDITS

This software contains assets from the Librosa repo (sample sounds for validation). See LICENSE.LIBROSA.md for information on permissions.

This software is released under the MIT Licenses, see LICENSE for more information.

Created By: Ryan Powell, 2024.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
captures		captures
classes		classes
scripts		scripts
test_audio		test_audio
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
audio_level_bar.tscn		audio_level_bar.tscn
capture_spectrogram.gd		capture_spectrogram.gd
capture_spectrogram.tscn		capture_spectrogram.tscn
default_bus_layout.tres		default_bus_layout.tres
heatmap.tres		heatmap.tres
icon.svg		icon.svg
icon.svg.import		icon.svg.import
mfcc_heatmap.tres		mfcc_heatmap.tres
project.godot		project.godot
rea8813.tmp		rea8813.tmp
realtime_spectrogram.gd		realtime_spectrogram.gd
realtime_spectrogram.tscn		realtime_spectrogram.tscn
realtime_vocoding.gd		realtime_vocoding.gd
realtime_vocoding.tscn		realtime_vocoding.tscn
speech_generator.tscn		speech_generator.tscn
vocoder.gd		vocoder.gd
vocoder.tscn		vocoder.tscn

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GD_Spectrogram

NOTE

CREDITS

About

Releases

Packages

Languages

License

InfernalWAVE/GD_Spectrogram

Folders and files

Latest commit

History

Repository files navigation

GD_Spectrogram

NOTE

CREDITS

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages