Skip to content
View upskyy's full-sized avatar
🇰🇷
Yes we can !!
🇰🇷
Yes we can !!

Organizations

@openspeech-team

Block or report upskyy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
upskyy/README.md

Welcome, my friends 😄

Machine Learning Engineer in MUSINSA

Research interests

  • I am passionate about solving real-world problems with AI.
  • During my time at ReturnZero, I worked on various speech domain challenges, including speech recognition, speech synthesis, and speech foundation models.
  • At MUSINSA, I am currently focused on leveraging LLMs, Natural Language Processing, and Computer Vision to address practical problems.
  • I believe that every experience is a connecting dot that will help me tackle bigger and more meaningful problems in the future.

Pinned Loading

  1. openspeech-team/openspeech openspeech-team/openspeech Public

    Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

    Python 700 116

  2. rtzr/Awesome-Korean-Speech-Recognition rtzr/Awesome-Korean-Speech-Recognition Public

    한국어 음성인식 STT API 리스트. 각 성능 벤치마크.

    410 22

  3. Squeezeformer Squeezeformer Public

    PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)

    Python 141 15

  4. Transformer-Transducer Transformer-Transducer Public

    PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)

    Python 104 19

  5. sgl-project/sglang sgl-project/sglang Public

    SGLang is a fast serving framework for large language models and vision language models.

    Python 14.8k 1.9k

  6. k2-fsa/sherpa k2-fsa/sherpa Public

    Speech-to-text server framework with next-gen Kaldi

    C++ 696 119