Skip to content
View SamuelCahyawijaya's full-sized avatar

Highlights

  • Pro

Organizations

@audioku @IndoNLP

Block or report SamuelCahyawijaya

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. IndoNLP/indonlu IndoNLP/indonlu Public

    The first-ever vast natural language processing benchmark for Indonesian Language. We provide multiple downstream tasks, pre-trained IndoBERT models, and a starter code! (AACL-IJCNLP 2020)

    Jupyter Notebook 631 207

  2. IndoNLP/nusa-crowd IndoNLP/nusa-crowd Public

    A collaborative project to collect datasets in Indonesian languages.

    Jupyter Notebook 275 63

  3. IndoNLP/nusax IndoNLP/nusax Public

    High-quality parallel resource on sentiment analysis for 10 low-resource Indonesian languages, English, and Indonesian (Outstanding Paper at EACL 2023)

    Jupyter Notebook 107 10

  4. IndoNLP/indonlg IndoNLP/indonlg Public

    The first-ever vast natural language generation benchmark for Indonesian, Sundanese, and Javanese. We provide multiple downstream tasks, pre-trained IndoGPT and IndoBART models, and a starter code!…

    Python 76 14

  5. IndoNLP/nusa-writes IndoNLP/nusa-writes Public

    NusaWrites is an in-depth analysis of corpora collection strategy and a comprehensive language modeling benchmark for underrepresented and extremely low-resource Indonesian local languages.

    Jupyter Notebook 27 1

  6. SEACrowd/seacrowd-datahub SEACrowd/seacrowd-datahub Public

    A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.

    Python 93 56