fft-embeddings

from the time domain to the frequency domain and back again, this time with text embeddings!

or, doing AI with librosa

read the full paper here

what this repo does:

split a large corpus of text into overlapping segments (see split_utils.split_text())
transpose the resulting time-domain embeddings (that is, a list of embeddings representing the sequential pieces of text) into the frequency domain with librosa's FFT implementation
do some signal processing in the frequency domain to, in principle, boost the signal-to-noise ratio of the embeddings (currently only lowpass filter is supported lmao)
transpose back to the time domain with librosa's ISTFT implementation where you can once again do all the classic embeddings-y things like semantic search, classification, etc.

results

sliced + fft'ed + lowpass'ed + istft'ed embeddings perform better than both openai's embeddings for the whole text and un-lowpass'ed sliced embeddings on the toy classification task in clf_demo.ipynb:

get embeddings for whole text: 97.1% accuracy
sliding window without lowpass filter: 96% accuracy
sliding window with lowpass filter @ 0.5: 97.6% accuracy

a cool visualization

spectrogram for channel 5 of openai's embeddings on the Gettysburg Address:

todo

explore frequency-domain "timbre"-based methods of performing nearest neighbors search
explore other spectral noise filtering implementations (adaptive filtering? is the noise static in this case?)
explore performance gains on other types of embeddings

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
texts		texts
.gitignore		.gitignore
README.md		README.md
clf_demo.ipynb		clf_demo.ipynb
corpus.py		corpus.py
exploring.ipynb		exploring.ipynb
gettysburg.py		gettysburg.py
spectro_5.png		spectro_5.png
split_utils.py		split_utils.py
too_many_lines.png		too_many_lines.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

texts

texts

.gitignore

.gitignore

README.md

README.md

clf_demo.ipynb

clf_demo.ipynb

corpus.py

corpus.py

exploring.ipynb

exploring.ipynb

gettysburg.py

gettysburg.py

spectro_5.png

spectro_5.png

split_utils.py

split_utils.py

too_many_lines.png

too_many_lines.png

Repository files navigation

fft-embeddings

what this repo does:

results

a cool visualization

todo

About

Releases

Packages

Languages

jagilley/fft-embeddings

Folders and files

Latest commit

History

Repository files navigation

fft-embeddings

what this repo does:

results

a cool visualization

todo

About

Resources

Stars

Watchers

Forks

Languages