Skip to content
Change the repository type filter

All

    Repositories list

    • WangchanX

      Public
      WangchanX Fine-tuning Pipeline
      Jupyter Notebook
      Apache License 2.0
      64210Updated Oct 4, 2024Oct 4, 2024
    • crfcut

      Public
      Thai sentence segmentation with conditional random fields
      Jupyter Notebook
      31503Updated Jun 17, 2024Jun 17, 2024
    • Python
      0300Updated Jun 11, 2024Jun 11, 2024
    • HTML
      0000Updated May 29, 2024May 29, 2024
    • 0100Updated May 29, 2024May 29, 2024
    • Python
      0500Updated May 29, 2024May 29, 2024
    • WangchanX Eval
      Python
      Apache License 2.0
      1900Updated May 29, 2024May 29, 2024
    • thai2nmt

      Public
      English-Thai Machine Translation Models
      Python
      Apache License 2.0
      62821Updated May 3, 2024May 3, 2024
    • Thai-NNER

      Public
      Pytorch implementation of paper: Thai Nested Named Entity Recognition
      Python
      MIT License
      73920Updated Jan 8, 2024Jan 8, 2024
    • Pretraining transformer based Thai language models
      Jupyter Notebook
      Apache License 2.0
      2211693Updated Nov 6, 2023Nov 6, 2023
    • Python
      Apache License 2.0
      1302Updated Nov 21, 2022Nov 21, 2022
    • colab

      Public
      Collections of Google Colab notebooks and some data.
      Jupyter Notebook
      8701Updated Sep 30, 2022Sep 30, 2022
    • Kaldi recipe to train commonvoice corpus in Thai language
      Shell
      83631Updated Aug 12, 2022Aug 12, 2022
    • mmocr

      Public
      OpenMMLab Text Detection, Recognition and Understanding Toolbox
      Python
      Apache License 2.0
      747000Updated Jul 25, 2022Jul 25, 2022
    • EasyOCR

      Public
      Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
      Python
      Apache License 2.0
      3.1k200Updated Jul 21, 2022Jul 21, 2022
    • SynthMIDI

      Public
      A single-note classification dataset generated from MIDI file.
      Python
      GNU General Public License v3.0
      0020Updated Jun 27, 2022Jun 27, 2022
    • Finetune wav2vec2-large-xlsr-53 with Thai Common Voice Corpus 7.0
      Jupyter Notebook
      Creative Commons Attribution Share Alike 4.0 International
      124531Updated Apr 23, 2022Apr 23, 2022
    • WSSET

      Public
      TF2 implementation of paper: Self-supervised Deep Metric Learning for Pointsets, ICDE 2021
      Python
      0700Updated Mar 7, 2022Mar 7, 2022
    • Speech Emotion Recognition using PyTorch sponsored by AIS and VISTEC-DEPA AIResearch Institute Thailand.
      Python
      Apache License 2.0
      51720Updated Nov 6, 2021Nov 6, 2021
    • Python
      0400Updated Sep 29, 2021Sep 29, 2021
    • Extract en-th parallel sentences from PDFs
      Python
      Other
      1200Updated Aug 20, 2021Aug 20, 2021
    • Lesson 0 - Orientation
      Jupyter Notebook
      MIT License
      7201Updated Jul 6, 2021Jul 6, 2021
    • 153610Updated Mar 26, 2021Mar 26, 2021
    • Generated product reviews dataset for machine translation quality estimation, part of [scb-mt-en-th-2020](https://arxiv.org/pdf/2007.03541.pdf)
      Jupyter Notebook
      Other
      0000Updated Dec 4, 2020Dec 4, 2020
    • ai2api

      Public
      Productionize NLP models trained on Pytorch by AIResearch.in.th
      Jupyter Notebook
      Apache License 2.0
      1100Updated Oct 7, 2020Oct 7, 2020
    • sme-depa

      Public
      Help small businesses make money from their transaction data; workshop at depa
      Jupyter Notebook
      Other
      4700Updated Aug 12, 2020Aug 12, 2020
    • nlp

      Public
      🤗nlp – Datasets and evaluation metrics for Natural Language Processing in NumPy, Pandas, PyTorch and TensorFlow
      Python
      Apache License 2.0
      2.7k000Updated Jul 14, 2020Jul 14, 2020
    • 31400Updated Jun 22, 2020Jun 22, 2020
    • 2100Updated Jun 3, 2020Jun 3, 2020
    • Scripts for crawling the 500 most visited websites in Thailand according to Alexa for `th` and `en` parallel texts.
      Python
      Apache License 2.0
      2500Updated May 6, 2020May 6, 2020