20210322

-

kaldi, Perceptual Linear Prediction, 感知线性预测系数

《图解语音识别》
Python语音信号特征-感知线性预测系数PLP
https://blog.csdn.net/weixin_42485817/article/details/107590846

Linear Prediction (LPC)

基于LPC的语音识别

linear prediction cepstrum coefficient,LPCC

Linear Predictive Cepstrum Coefficients,LPCC
线性预测倒谱系数

LPCMCC

LPC美尔倒频谱系数

DCT（离散余弦变换）

图像压缩方面
https://baike.baidu.com/item/离散余弦变换/7118270?fr=aladdin

(IMP???) pytorch crnn

https://github.com/isadrtdinov/kws-attention

TensorFlowLite Micro: Embedded Machine Learning on TinyML Systems

https://arxiv.org/pdf/2010.08678.pdf
https://github.com/raspberrypi/pico-tflmicro

CRNN网络结构详解

https://www.jianshu.com/p/4ac876a4cd5c

gpt-neo

https://github.com/EleutherAI/gpt-neo

强大如GPT-3，1750亿参数也搞不定中文？

https://www.huxiu.com/article/375604.html

Tatoeba-Challenge

https://github.com/Helsinki-NLP/Tatoeba-Challenge

PYTHON-AND-DATA-ANALYTICS-7-DAYS

https://github.com/ShapeAI/PYTHON-AND-DATA-ANALYTICS-7-DAYS

introduction-to-machine-learning

https://github.com/globalaihub/introduction-to-machine-learning

Liquid Warping GAN with Attention: A Unified Framework for Human Image Synthesis

https://github.com/iPERDance/iPERCore

(???) Deep Learning for NLP and Speech Recognition

???

(IMP) GitHub-Chinese-Top-Charts

search tensorflow or nn or pytorch
https://github.com/kon9chunkit/GitHub-Chinese-Top-Charts/blob/master/README-Part2.md#C

USound MEMS 扬声器开发套件

https://www.cirmall.com/bbs/thread-204315-1-1.html?eefocus

(IMP) jetson nano, 微雪, jetson-inference

https://www.waveshare.net/wiki/Jetson_Nano_Developer_Kit_Package_D
https://www.waveshare.net/study/article-892-1.html
https://github.com/dusty-nv/jetson-inference
https://www.waveshare.net/study/article-889-1.html
https://www.waveshare.net/study/article-893-1.html
https://www.pianshen.com/article/7547357666/
search baidupan, MNIST_TEST.zip
search baidupan, networks.zip
https://wiki.seeedstudio.com/cn/Jetson_Nano_OutBoxing_Demo/

jetson nano, TensorRT

https://developer.nvidia.com/zh-cn/tensorrt
https://github.com/NVIDIA/TensorRT
cuDNN, cuda
VPI
https://developer.nvidia.com/embedded/vpi
JetPack
https://developer.nvidia.com/zh-cn/embedded/jetpack
http://www.gpus.cn/gpus_list_page_techno_support_content?id=101

学界 | 论文撞车英伟达，一作「哭晕在厕所」，英伟达：要不要来实习？

https://www.sohu.com/a/274428157_129720

我的朋友，深有同感。我几周前和谷歌撞车，几个月前还和 DeepMind 撞车。我是搞人工智能的，又不是开碰碰车的。

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

https://github.com/sebastianruder/NLP-progress

NVIDIA ASR

https://github.com/shuaaa/NVIDIA-DeepLearning
https://github.com/NVIDIA/DeepLearningExamples/tree/master/Kaldi/SpeechRecognition
https://github.com/NVIDIA/DeepLearningExamples/tree/master/PyTorch/SpeechRecognition/Jasper

TensorFlow Lite Micro

https://github.com/search?q=TF_LITE_REPORT_ERROR+main_functions&type=code
https://github.com/mlubinsky/mlubinsky.github.com/blob/5991cad85b90b5f4d051014cee66b565fe22040b/sound/README.md
https://github.com/search?q=TF_LITE_REPORT_ERROR+main_functions+NOLINTNEXTLINE+spectrogram&type=Code
TfLiteConv3DParams
https://github.com/search?l=C&q=TfLiteConv3DParams&type=Code

Wio Terminal TFLM

https://github.com/Seeed-Studio/Seeed_Arduino_Sketchbook/blob/d59d97a35d696e18d51a329fc7aacb8d04c680a8/examples/WioTerminal_TinyML_4_Weather_Prediction/tensorflow_lite/library.properties
https://github.com/Seeed-Studio/Seeed_Arduino_Sketchbook/tree/master/examples/WioTerminal_TinyML_2_Audio_Scene_Recognition

tflite4zero_env

https://github.com/NewComer00/tflite4zero_env

TensorFlow_MIMXRT1064-EVK_Microspeech

https://github.com/ARMmbed/TensorFlow_MIMXRT1064-EVK_Microspeech

tensorflow-examples

https://github.com/antmicro/tensorflow-examples

voice-commands-using-arduino-and-ml

https://github.com/Apress/voice-commands-using-arduino-and-ml

stm32-tflm-micro-speech

https://github.com/tum-ei-eda/stm32-tflm-micro-speech

k210

https://github.com/fjpolo/eML

lib_audio_features, MFCC

https://github.com/xmos/lib_audio_features

ML-Sound-Classification

https://github.com/villasen/ML-Sound-Classification

(IMP???) same54 kws

https://microchipdeveloper.com/machine-learning:keywordspotting-with-edge-impulse
https://github.com/MicrochipTech/ml-same54-cult-wm8904-edgeimpulse-kws-demo

lyra

https://github.com/google/lyra

mlflow

https://github.com/mlflow/mlflow

numba

https://github.com/numba/numba

miniaudio

https://github.com/mackron/miniaudio

axon

https://github.com/elixir-nx/axon

(IMP) MAX9812

search baidupan, MAX9812
KY-038 麦克风放大器模块

(IMP) ESP32-8-Octave-Audio-Spectrum-Display, fft

https://github.com/G6EJD/ESP32-8-Octave-Audio-Spectrum-Display
https://github.com/kosme/arduinoFFT

spleeter

https://github.com/deezer/spleeter

mozilla/TTS

https://github.com/mozilla/TTS

(IMP) ESP32 Audio Input - MAX4466, MAX9814, SPH0645LM4H, INMP441

https://blog.cmgresearch.com/2020/09/12/esp32-audio-input.html
ESP32音频输入-MAX4466，MAX9814，SPH0645LM4H，INMP441(翻译)
https://www.cnblogs.com/kerwincui/p/13751746.html
search baidupan, esp32-audio-input.docx

ESP32_MP3_Decoder

https://github.com/MrBuddyCasino/ESP32_MP3_Decoder

A C++ standalone library for machine learning

https://github.com/flashlight/flashlight

500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code

https://github.com/ashishpatel26/500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code

start-machine-learning-in-2020

https://github.com/louisfb01/start-machine-learning-in-2020

物联网创新项目开发与实践

search baidupan, 物联网创新项目开发与实践

Machine_Learning_2_months

https://github.com/Minhluu2911/Machine_Learning_2_months

onnx

https://github.com/onnx/onnx

(IMP???) SpeechBrain, A PyTorch Powered Speech Toolkit

https://github.com/speechbrain/speechbrain
https://speechbrain.github.io

tensorflow-pack

https://github.com/MDK-Packs/tensorflow-pack

yolov5-face

https://github.com/deepcam-cn/yolov5-face

mlflow

https://github.com/mlflow/mlflow

tensorboard

https://github.com/tensorflow/tensorboard

The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools

https://github.com/huggingface/datasets

A C++ standalone library for machine learning

https://github.com/flashlight/flashlight

realsense-ros

https://github.com/IntelRealSense/realsense-ros

(IMP???) Continual_Learning_for_KWS

https://github.com/jianvora/Continual_Learning_for_KWS
https://gitee.com/weimingtom2000/Continual_Learning_for_KWS/tree/main/Keyword%20Spotting/TC-Resnet

Matlab计算机视觉与深度学习实战

https://github.com/decouples/Matlab_deep_learning
第 19 章基于语音识别的信号灯图像模拟控制技术

pytorch-tutorial

https://github.com/yunjey/pytorch-tutorial

DouZero, 斗地主AI

https://github.com/kwai/DouZero

coqui-ai/TTS, a deep learning toolkit for Text-to-Speech, battle-tested in research and production

https://github.com/coqui-ai/TTS

sounddevice

https://github.com/spatialaudio/python-sounddevice
https://gist.github.com/akey7/94ff0b4a4caf70b98f0135c1cd79aff3

# Use the sounddevice module
# http://python-sounddevice.readthedocs.io/en/0.3.10/

import numpy as np
import sounddevice as sd
import time

# Samples per second
sps = 44100

# Frequency / pitch
freq_hz = 440.0

# Duration
duration_s = 5.0

# Attenuation so the sound is reasonable
atten = 0.3

# NumpPy magic to calculate the waveform
each_sample_number = np.arange(duration_s * sps)
waveform = np.sin(2 * np.pi * each_sample_number * freq_hz / sps)
waveform_quiet = waveform * atten

# Play the waveform out the speakers
sd.play(waveform_quiet, sps)
time.sleep(duration_s)
sd.stop()

import pyaudio
from scipy.io import wavfile

sr, wdata=wavfile.read('house_lo.wav')

p = pyaudio.PyAudio()
stream = p.open(format = p.get_format_from_width(1), channels = 1, rate = sr, output = True)
stream.write(wdata)
stream.stop_stream()
stream.close()
p.terminate()

darts, A python library for easy manipulation and forecasting of time series.

https://github.com/unit8co/darts

Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)

https://github.com/jackaduma/LAS_Mandarin_PyTorch

Files

asr_005.md

Latest commit

History

asr_005.md

File metadata and controls

20210322

-

kaldi, Perceptual Linear Prediction, 感知线性预测系数

Linear Prediction (LPC)

linear prediction cepstrum coefficient,LPCC

LPCMCC

DCT（离散余弦变换）

(IMP???) pytorch crnn

TensorFlowLite Micro: Embedded Machine Learning on TinyML Systems

CRNN网络结构详解

gpt-neo

强大如GPT-3，1750亿参数也搞不定中文？

Tatoeba-Challenge

PYTHON-AND-DATA-ANALYTICS-7-DAYS

introduction-to-machine-learning

Liquid Warping GAN with Attention: A Unified Framework for Human Image Synthesis

(???) Deep Learning for NLP and Speech Recognition

(IMP) GitHub-Chinese-Top-Charts

USound MEMS 扬声器开发套件

(IMP) jetson nano, 微雪, jetson-inference

jetson nano, TensorRT

学界 | 论文撞车英伟达，一作「哭晕在厕所」，英伟达：要不要来实习？

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

NVIDIA ASR

TensorFlow Lite Micro

Wio Terminal TFLM

tflite4zero_env

TensorFlow_MIMXRT1064-EVK_Microspeech

tensorflow-examples

voice-commands-using-arduino-and-ml

stm32-tflm-micro-speech

k210

lib_audio_features, MFCC

ML-Sound-Classification

(IMP???) same54 kws

lyra

mlflow

numba

miniaudio

axon

(IMP) MAX9812

(IMP) ESP32-8-Octave-Audio-Spectrum-Display, fft

spleeter

mozilla/TTS

(IMP) ESP32 Audio Input - MAX4466, MAX9814, SPH0645LM4H, INMP441

ESP32_MP3_Decoder

A C++ standalone library for machine learning

500-AI-Machine-learning-Deep-learning-Computer-vision-NLP-Projects-with-code

start-machine-learning-in-2020

物联网创新项目开发与实践

Machine_Learning_2_months

onnx

(IMP???) SpeechBrain, A PyTorch Powered Speech Toolkit

tensorflow-pack

yolov5-face

mlflow

tensorboard

The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools

A C++ standalone library for machine learning

realsense-ros

(IMP???) Continual_Learning_for_KWS

Matlab计算机视觉与深度学习实战

pytorch-tutorial

DouZero, 斗地主AI

coqui-ai/TTS, a deep learning toolkit for Text-to-Speech, battle-tested in research and production

sounddevice

darts, A python library for easy manipulation and forecasting of time series.

Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)