Open-source data-centric IDE for NLP

Hello everyone, we are an open-source company that tries to evolve data labeling to its next level. In short, we are the open-source data-centric IDE for NLP. Combining (semi-)automated labeling, extensive data management, and neural search capabilities.

We have just launched our tool “refinery” on GitHub and are now looking to spread the word about what we are doing, to the data science and open-source community.

If you would like to take a look at our repo (and maybe even give it a star), you will find it here: GitHub - code-kern-ai/refinery: The open-source data-centric IDE for NLP. Combining programmatic labeling, extensive data management and neural search capabilities..

Or, if you just want to keep up to date with our progress, you can join our Discord (just message me for an invite) or subscribe to our newsletter on our website,

For a detailed explanation of how the refinery works, you can check out this blog post we recently published:

1 Like