Support Usage Tokens Output in Claude API #667

exa256 · 2024-05-14T11:55:17Z

Is your feature request related to a problem? Please describe.
Currently, reporting usage dictionary from OpenAI API is supported as seen in this document and usage dictionary. https://python.useinstructor.com/concepts/usage/?h=token+usage

However, Claude API patch does not have this functionality, even though usage is available from a successful 200 response from Anthropic's server:
200 Response from https://docs.anthropic.com/en/api/messages

{
  "content": [
    {
      "text": "Hi! My name is Claude.",
      "type": "text"
    }
  ],
  "id": "msg_013Zva2CMHLNnXjNJJKqJ2EF",
  "model": "claude-3-opus-20240229",
  "role": "assistant",
  "stop_reason": "end_turn",
  "stop_sequence": null,
  "type": "message",
  "usage": {
    "input_tokens": 10,
    "output_tokens": 25
  }
}

Describe the solution you'd like
Instructor should patch Claude's API and surface the usage dictionary as part of the output in the second tuple like so:

structure_output, completion = client.chat.completions.create_with_completion(...)
completion.usage # should returns usage, consists of input and output tokens

The text was updated successfully, but these errors were encountered:

Elijas · 2024-05-30T07:52:16Z

Use Anthropic Claude through LiteLLM, the usage and cost gets reported

import instructor
from litellm import completion
from litellm import completion, completion_cost, cost_per_token
from pydantic import BaseModel


class User(BaseModel):
    name: str
    age: int


client = instructor.from_litellm(completion)

resp, completion = client.chat.completions.create_with_completion(
    model="claude-3-opus-20240229",
    max_tokens=1024,
    messages=[
        {
            "role": "user",
            "content": "Extract Jason is 25 years old.",
        }
    ],
    response_model=User,
)

assert isinstance(resp, User)
assert resp.name == "Jason"
assert resp.age == 25

usage = completion.usage
input_tokens = usage.prompt_tokens
output_tokens = usage.completion_tokens
total_tokens = usage.total_tokens
    input_cost_usd, output_cost_usd = cost_per_token(model, prompt_tokens=input_tokens, completion_tokens=output_tokens)
    completion_cost_usd = completion_cost(completion_response=raw_result)

ssonal · 2024-06-01T03:19:29Z

Describe the solution you'd like
Instructor should patch Claude's API and surface the usage dictionary as part of the output in the second tuple like so:

structured_output._raw_response.usage works but doesn't take retries into account.

@jxnl maybe we attach cumulative usage data here? It's currently getting lost while processing response.

instructor/instructor/process_response.py

Lines 97 to 100 in 081418d

    
           model._raw_response = response 
        
           return model

Elijas · 2024-06-01T03:35:10Z

Describe the solution you'd like
Instructor should patch Claude's API and surface the usage dictionary as part of the output in the second tuple like so:

structured_output._raw_response.usage works but doesn't take retries into account.

@jxnl maybe we attach cumulative usage data here? It's currently getting lost while processing response.

instructor/instructor/process_response.py

Lines 97 to 100 in 081418d

model._raw_response = response

return model

related #715

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Usage Tokens Output in Claude API #667

Support Usage Tokens Output in Claude API #667

exa256 commented May 14, 2024

Elijas commented May 30, 2024 •

edited

ssonal commented Jun 1, 2024

Elijas commented Jun 1, 2024

Support Usage Tokens Output in Claude API #667

Support Usage Tokens Output in Claude API #667

Comments

exa256 commented May 14, 2024

Elijas commented May 30, 2024 • edited

ssonal commented Jun 1, 2024

Elijas commented Jun 1, 2024

Elijas commented May 30, 2024 •

edited