Support for GPT2 and GPTJ? #35

mallorbc · 2023-04-05T23:26:03Z

In the GGML repo there are guides for converting those models into GGML format, including int4 support. I have successfully done so myself and ran those models using the GPTJ binary in the examples.

In this repo here, there is support for GPTJ models with an API-like interface, but the downside is that each time you make an API call, the model has to be loaded adding 2 or seconds each time you make a call.

What I like about this repo is that it uses pybindings to load the model into memory and then allows API calls without reloading the model.

My question/request is for support for GPTJ. I know it can be done with little to no code changes. If the code is abstracted enough using GPTJ after conversion may already work.

I think worst-case scenario the example under GPTJ needs to be modified a bit. I will take a look at this if it does not currently work and will contribute back if I get a working solution

mikeggh · 2023-04-05T23:54:36Z

I'm using ggjt file formats perfectly fine at the moment.. (the one which works great with mmap on recent llama.cpp vers, which i believe at linked into this)

mallorbc · 2023-04-06T01:14:04Z

@mikeggh To be clear, you are using GPTJ int4 GGML models fine with this repo at the moment? That is great to hear if I understood correctly!

mallorbc · 2023-04-07T18:10:08Z

This does not work for me. If I use the model conversion tool at the GGML repo seen here or if I use the cformers repo seen here I get an error with regards to needing to update the format of the model. Trying to update the format of the model leads to issues.

I think it may be worthwhile to try an older version of LLama.cpp. I may try that.

abdeladim-s · 2023-04-07T22:28:24Z

@mallorbc, We are planning to support GPTJ as well, but for now, the code will work only with llama (I guess).
You can give it a try though and see if you have any ideas. PRs are always welcome!

AndriyMulyar · 2023-04-09T18:00:34Z

GPT-J support incoming in the next few days (alongside assistant-style GPT-j model release)

mallorbc · 2023-04-10T21:27:29Z

@AndriyMulyar That is great to hear! Looking forward to GPTJ support!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for GPT2 and GPTJ? #35

Support for GPT2 and GPTJ? #35

mallorbc commented Apr 5, 2023

mikeggh commented Apr 5, 2023 •

edited

Loading

mallorbc commented Apr 6, 2023

mallorbc commented Apr 7, 2023

abdeladim-s commented Apr 7, 2023

AndriyMulyar commented Apr 9, 2023

mallorbc commented Apr 10, 2023

Support for GPT2 and GPTJ? #35

Support for GPT2 and GPTJ? #35

Comments

mallorbc commented Apr 5, 2023

mikeggh commented Apr 5, 2023 • edited Loading

mallorbc commented Apr 6, 2023

mallorbc commented Apr 7, 2023

abdeladim-s commented Apr 7, 2023

AndriyMulyar commented Apr 9, 2023

mallorbc commented Apr 10, 2023

mikeggh commented Apr 5, 2023 •

edited

Loading