《图解语音识别》
Python语音信号特征-感知线性预测系数PLP
https://blog.csdn.net/weixin_42485817/article/details/107590846
基于LPC的语音识别
Linear Predictive Cepstrum Coefficients,LPCC
线性预测倒谱系数
LPC美尔倒频谱系数
图像压缩方面
https://baike.baidu.com/item/离散余弦变换/7118270?fr=aladdin
https://github.com/isadrtdinov/kws-attention
https://arxiv.org/pdf/2010.08678.pdf
https://github.com/raspberrypi/pico-tflmicro
https://www.jianshu.com/p/4ac876a4cd5c
https://github.com/EleutherAI/gpt-neo
https://www.huxiu.com/article/375604.html
https://github.com/Helsinki-NLP/Tatoeba-Challenge
https://github.com/ShapeAI/PYTHON-AND-DATA-ANALYTICS-7-DAYS
https://github.com/globalaihub/introduction-to-machine-learning
https://github.com/iPERDance/iPERCore
???
search tensorflow or nn or pytorch
https://github.com/kon9chunkit/GitHub-Chinese-Top-Charts/blob/master/README-Part2.md#C
https://www.cirmall.com/bbs/thread-204315-1-1.html?eefocus
https://www.waveshare.net/wiki/Jetson_Nano_Developer_Kit_Package_D
https://www.waveshare.net/study/article-892-1.html
https://github.com/dusty-nv/jetson-inference
https://www.waveshare.net/study/article-889-1.html
https://www.waveshare.net/study/article-893-1.html
https://www.pianshen.com/article/7547357666/
search baidupan, MNIST_TEST.zip
search baidupan, networks.zip
https://wiki.seeedstudio.com/cn/Jetson_Nano_OutBoxing_Demo/
https://developer.nvidia.com/zh-cn/tensorrt
https://github.com/NVIDIA/TensorRT
cuDNN, cuda
VPI
https://developer.nvidia.com/embedded/vpi
JetPack
https://developer.nvidia.com/zh-cn/embedded/jetpack
http://www.gpus.cn/gpus_list_page_techno_support_content?id=101
https://www.sohu.com/a/274428157_129720
我的朋友,深有同感。我几周前和谷歌撞车,几个月前还和 DeepMind 撞车。我是搞人工智能的,又不是开碰碰车的。
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
https://github.com/sebastianruder/NLP-progress
https://github.com/shuaaa/NVIDIA-DeepLearning
https://github.com/NVIDIA/DeepLearningExamples/tree/master/Kaldi/SpeechRecognition
https://github.com/NVIDIA/DeepLearningExamples/tree/master/PyTorch/SpeechRecognition/Jasper
https://github.com/search?q=TF_LITE_REPORT_ERROR+main_functions&type=code
https://github.com/mlubinsky/mlubinsky.github.com/blob/5991cad85b90b5f4d051014cee66b565fe22040b/sound/README.md
https://github.com/search?q=TF_LITE_REPORT_ERROR+main_functions+NOLINTNEXTLINE+spectrogram&type=Code
TfLiteConv3DParams
https://github.com/search?l=C&q=TfLiteConv3DParams&type=Code
https://github.com/Seeed-Studio/Seeed_Arduino_Sketchbook/blob/d59d97a35d696e18d51a329fc7aacb8d04c680a8/examples/WioTerminal_TinyML_4_Weather_Prediction/tensorflow_lite/library.properties
https://github.com/Seeed-Studio/Seeed_Arduino_Sketchbook/tree/master/examples/WioTerminal_TinyML_2_Audio_Scene_Recognition
https://github.com/NewComer00/tflite4zero_env
https://github.com/ARMmbed/TensorFlow_MIMXRT1064-EVK_Microspeech
https://github.com/antmicro/tensorflow-examples
https://github.com/Apress/voice-commands-using-arduino-and-ml
https://github.com/tum-ei-eda/stm32-tflm-micro-speech
https://github.com/xmos/lib_audio_features
https://github.com/villasen/ML-Sound-Classification
https://microchipdeveloper.com/machine-learning:keywordspotting-with-edge-impulse
https://github.com/MicrochipTech/ml-same54-cult-wm8904-edgeimpulse-kws-demo
https://github.com/google/lyra
https://github.com/mlflow/mlflow
https://github.com/numba/numba
https://github.com/mackron/miniaudio
https://github.com/elixir-nx/axon
search baidupan, MAX9812
KY-038 麦克风放大器模块
https://github.com/G6EJD/ESP32-8-Octave-Audio-Spectrum-Display
https://github.com/kosme/arduinoFFT
https://github.com/deezer/spleeter
https://github.com/mozilla/TTS
https://blog.cmgresearch.com/2020/09/12/esp32-audio-input.html
ESP32音频输入-MAX4466,MAX9814,SPH0645LM4H,INMP441(翻译)
https://www.cnblogs.com/kerwincui/p/13751746.html
search baidupan, esp32-audio-input.docx
https://github.com/MrBuddyCasino/ESP32_MP3_Decoder
https://github.com/flashlight/flashlight
https://github.com/louisfb01/start-machine-learning-in-2020
search baidupan, 物联网创新项目开发与实践
https://github.com/Minhluu2911/Machine_Learning_2_months
https://github.com/speechbrain/speechbrain
https://speechbrain.github.io
https://github.com/MDK-Packs/tensorflow-pack
https://github.com/deepcam-cn/yolov5-face
https://github.com/mlflow/mlflow
https://github.com/tensorflow/tensorboard
The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools
https://github.com/huggingface/datasets
https://github.com/flashlight/flashlight
https://github.com/IntelRealSense/realsense-ros
https://github.com/jianvora/Continual_Learning_for_KWS
https://gitee.com/weimingtom2000/Continual_Learning_for_KWS/tree/main/Keyword%20Spotting/TC-Resnet
https://github.com/decouples/Matlab_deep_learning
第 19 章 基于语音识别的信号灯图像模拟控制技术
https://github.com/yunjey/pytorch-tutorial
https://github.com/kwai/DouZero
https://github.com/coqui-ai/TTS
https://github.com/spatialaudio/python-sounddevice
https://gist.github.com/akey7/94ff0b4a4caf70b98f0135c1cd79aff3
# Use the sounddevice module
# http://python-sounddevice.readthedocs.io/en/0.3.10/
import numpy as np
import sounddevice as sd
import time
# Samples per second
sps = 44100
# Frequency / pitch
freq_hz = 440.0
# Duration
duration_s = 5.0
# Attenuation so the sound is reasonable
atten = 0.3
# NumpPy magic to calculate the waveform
each_sample_number = np.arange(duration_s * sps)
waveform = np.sin(2 * np.pi * each_sample_number * freq_hz / sps)
waveform_quiet = waveform * atten
# Play the waveform out the speakers
sd.play(waveform_quiet, sps)
time.sleep(duration_s)
sd.stop()
import pyaudio
from scipy.io import wavfile
sr, wdata=wavfile.read('house_lo.wav')
p = pyaudio.PyAudio()
stream = p.open(format = p.get_format_from_width(1), channels = 1, rate = sr, output = True)
stream.write(wdata)
stream.stop_stream()
stream.close()
p.terminate()