feat: add groq api and model options #108

gluneau · 2024-04-06T17:45:55Z

What does this implement

Add Groq API and model options

get free API key here: https://console.groq.com/keys

use as usual by naming the groq model in the command:

python run.py --model_name groq-mixtral8x7b \                
 --data_path https://github.com/pvlib/pvlib-python/issues/1603 \
 --config_file config/default.yaml

klieret · 2024-04-08T20:41:23Z

sweagent/agent/models.py

+        stop=stop_after_attempt(3),
+        retry=retry_if_not_exception_type((CostLimitExceededError, RuntimeError)),
+    )
+    def query(self, history: list[dict[str, str]]) -> str:


How similar is this and history_to_messages to the OpenaAIModel class? Because if that's just exactly the same, let's just subclass from there to cut down on duplicity.

It is the same since I'm using the OpenAI() client because Groq is design to be compatible with OpenAi.
https://console.groq.com/docs/openai

klieret · 2024-04-08T20:44:50Z

Note: #108 and #134 seem to have the same goal.

crjaensch · 2024-04-10T18:41:05Z

Note: #108 and #134 seem to have the same goal.

I agree. I have no particular preference whether you want to approve #108 or PR #134. In fact, PR #108 seems to have a better separation of concern than the minimal surgical changes proposed by my PR.
However, I would recommend that PR #108 follows the model naming convention that you introduced for example for Azure or Ollama models.
That means, a Groq model like Mixtral 8x7B should be prefixed as groq:<model_name> rather than groq-<model_name> as proposed in PR #108.

codecov · 2024-04-11T17:53:34Z

Codecov Report

Attention: Patch coverage is 29.16667% with 17 lines in your changes missing coverage. Please review.

Project coverage is 65.09%. Comparing base (b084c9f) to head (a0c0347).
Report is 873 commits behind head on main.

Files with missing lines	Patch %	Lines
sweagent/agent/models.py	29.16%	17 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #108      +/-   ##
==========================================
- Coverage   65.51%   65.09%   -0.43%     
==========================================
  Files          16       16              
  Lines        2117     2140      +23     
==========================================
+ Hits         1387     1393       +6     
- Misses        730      747      +17

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

EwoutH · 2024-04-22T13:29:17Z

I think groq currently is one of the most interesting options for API-based agents. It's not only the fastest (by far), but also the cheapest, at running many models, including Llama 3.

Would love to see it supported in SWE-agent!

asapsav · 2024-05-12T04:26:03Z

hi! just a couple of adds:

in class GroqModel(BaseModel): llama2 should be changed to "llama3-70b-8192"
after a couple of agent iterations (using groq) an "Exit due to cost limit" error appears, which does not make sence cause groq is 0$

klieret · 2024-05-14T14:45:54Z

Apologies letting this PR hang for such a long time. Part of the reason is that models.py is just growing and growing and we should instead just support litellm (potentially leaving claude and gpt as before to make sure the paper results are still reproducible, unless we check it's 100% of a drop-in replacement). The other reason is that so far only the largest models (gpt, claude) were good enough to make sense for swe-agent, which pushed the support for other models down the list of priorities.

But let me come back with a concrete plan later this week!

klieret · 2024-10-09T22:22:25Z

Closing because we should use litellm to support more models (#64 )

feat: add groq api and model options

259b2c3

klieret added the ➕ feature label Apr 6, 2024

Merge branch 'main' into feat/addgroq

99dbc20

klieret reviewed Apr 8, 2024

View reviewed changes

This was referenced Apr 8, 2024

Minimal changes to support models of Groq Cloud #134

Closed

Support Gab.ai API tokens #169

Closed

Merge branch 'main' into feat/addgroq

a0c0347

klieret modified the milestone: 0.2.1 Apr 15, 2024

EwoutH mentioned this pull request Apr 22, 2024

Support Deepinfra API #266

Closed

klieret closed this Oct 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add groq api and model options #108

feat: add groq api and model options #108

gluneau commented Apr 6, 2024

klieret Apr 8, 2024

gluneau Apr 8, 2024

klieret commented Apr 8, 2024

crjaensch commented Apr 10, 2024 •

edited

Loading

codecov bot commented Apr 11, 2024 •

edited

Loading

EwoutH commented Apr 22, 2024

asapsav commented May 12, 2024

klieret commented May 14, 2024 •

edited

Loading

klieret commented Oct 9, 2024

feat: add groq api and model options #108

feat: add groq api and model options #108

Conversation

gluneau commented Apr 6, 2024

What does this implement

klieret Apr 8, 2024

Choose a reason for hiding this comment

gluneau Apr 8, 2024

Choose a reason for hiding this comment

klieret commented Apr 8, 2024

crjaensch commented Apr 10, 2024 • edited Loading

codecov bot commented Apr 11, 2024 • edited Loading

Codecov Report

EwoutH commented Apr 22, 2024

asapsav commented May 12, 2024

klieret commented May 14, 2024 • edited Loading

klieret commented Oct 9, 2024

crjaensch commented Apr 10, 2024 •

edited

Loading

codecov bot commented Apr 11, 2024 •

edited

Loading

klieret commented May 14, 2024 •

edited

Loading