add basic wasm support #122

ifsheldon · 2023-10-16T09:08:55Z

This is based on #120 and #121.

To summarize:

Added two feature gates enable_tokio and enable_backoff
- backoff is somehow difficult to make it support wasm, so I just decided to gate it under a feature
A minor API breaking change. Stream related functions now return OpenAIEventStream but it should not matter as long as users only use it in while let await loops since it also implements Stream
Updated reqwest-eventsource

This should close #102 unless file-related ops are wanted.

I tested the code by
cargo build --target wasm32-unknown-unknown --no-default-features --features wasm and an web app.

The code is

// main.rs
use async_openai::types::{ChatCompletionRequestMessageArgs, CreateChatCompletionRequestArgs, Role};
use dioxus::prelude::*;
use futures::stream::StreamExt;
use anyhow::Result;

const API_BASE: &str = "...";
const API_KEY: &str = "...";
const API_VERSION: &str = "...";
const DEPLOYMENT_ID: &str = "...";

pub fn app(cx: Scope) -> Element {
    let ok_count = use_state(cx, || 0_usize);
    let err_count = use_state(cx, || 0_usize);
    let response_string: &UseRef<String> = use_ref(cx, String::new);
    let fetch_completion_chunks: &Coroutine<()> = use_coroutine(cx, |rx| {
        let ok_count = ok_count.to_owned();
        let err_count = err_count.to_owned();
        let response_string = response_string.to_owned();
        async move {
            let config = async_openai::config::AzureConfig::new()
                .with_api_base(API_BASE)
                .with_api_key(API_KEY)
                .with_api_version(API_VERSION)
                .with_deployment_id(DEPLOYMENT_ID);
            let client = async_openai::Client::with_config(config);
            let request = CreateChatCompletionRequestArgs::default()
                .max_tokens(512u16)
                .model("gpt-3.5-turbo-0613")
                .messages([ChatCompletionRequestMessageArgs::default()
                    .role(Role::User)
                    .content("Hello!")
                    .build().unwrap()])
                .build().unwrap();
            let mut stream = client.chat().create_stream(request).await.unwrap();
            while let Some(chunk) = stream.next().await {
                match chunk {
                    Ok(response) => {
                        ok_count.modify(|x| *x + 1);
                        response_string.with_mut(|old| {
                            old.push('\n');
                            old.push_str(format!("{:?}", response).as_str());
                        })
                    }
                    Err(e) => {
                        err_count.modify(|x| *x + 1);
                    }
                }
            }
        }
    });

    render! {
        div {
            p {
                "{response_string.read()}"
            }
            p {
                "ok_count: {ok_count.get()}"
            }
            p {
                "err_count: {err_count.get()}"
            }
        }
    }
}

fn dioxus_main() {
    dioxus_web::launch(app);
}

async fn async_openai_main() -> Result<()>  {
    let config = async_openai::config::AzureConfig::new()
        .with_api_base(API_BASE)
        .with_api_key(API_KEY)
        .with_api_version(API_VERSION)
        .with_deployment_id(DEPLOYMENT_ID);
    let client = async_openai::Client::with_config(config);
    let request = CreateChatCompletionRequestArgs::default()
        .max_tokens(512u16)
        .model("gpt-3.5-turbo-0613")
        .messages([ChatCompletionRequestMessageArgs::default()
            .role(Role::User)
            .content("Hello!")
            .build()?])
        .build()?;
    let mut stream = client.chat().create_stream(request).await?;
    while let Some(chunk) = stream.next().await {
        println!("{:?}", chunk);
    }
    Ok(())
}

// for tokio testing
#[tokio::main]
async fn main() -> Result<()> {
    async_openai_main().await
}

// for wasm testing
// fn main() {
//     dioxus_main();
// }

Cargo.toml

[package]
name = "wasm_test"
version = "0.1.0"
edition = "2021"

# See more keys and their definitions at https://doc.rust-lang.org/cargo/reference/manifest.html

[dependencies]
dioxus = "~0.4"
dioxus-web = "~0.4"
futures = "0.3.28"
reqwest = { version = "0.11", features = ["json"] }
reqwest-eventsource = "0.5"
# for wasm
#async-openai = { path = "../async-openai/async-openai", default-features = false, features = ["wasm"] }
serde_json = "~1.0"
serde = { version = "1.0", features = ["derive"] }
anyhow = "~1.0"
# for native async-openai
async-openai = { path = "../async-openai/async-openai" }
tokio = { version = "1.32", features = ["full"] }

# Conflicts: # async-openai/Cargo.toml # async-openai/src/client.rs

and remove some unnecessary feature gates

64bit · 2023-10-17T02:46:58Z

Hi @ifsheldon

Thank you for all of your good work!

This needs some work on documentation:

A self contained example so its easy for me to test (like other examples) and also acts as documentation of the feature.
Perhaps examples/azure-wasm or something with wasm in it - for the example that you have put in the description?
wasm feature description on README for wasm to show on crates.io and GitHub
A brief wasm section in lib.rs below Azure docs, to show on docs.rs for the crate

I also have few questions:

Perhaps a bit intro/docs on feature flags would help new folks too?
What's the rationale behind introducting OpenAIEventStream ?
The original WASM request was for platforms like Cloudflare Workers, would this work? Perhaps add another self contained example?
It is not clear which APIs are supported by this PR in wasm and which aren't unless someone goes through code.

Regarding your note about testing on OpenAI in the other PR - I'll be happy to help test changes and let you know how it goes - please expect some delay from my side though.

ifsheldon · 2023-10-17T15:01:50Z

For documentation, I will add more when I have some time.

Perhaps a bit intro/docs on feature flags would help new folks too?

Sure, but I don't know much about the feature flags that existing in main now, tls related ones.

What's the rationale behind introducting OpenAIEventStream ?

To get rid of tokio.spawn and further get rid of tokio. Lik in add a struct to transform EventSource stream into OpenAI response stream #121, I think what you did just transforming a stream into another one, but you did it using two tasks (a tx task and a rx task), which needs tokio.spawn. I can't just use stream filtering and mapping mainly because if message.data == "[DONE]" branch which is early stoping a stream.
Choosing to return OpenAIEventStream is another story, but, simply because I couldn't win the fighting with rustc. You can try reverting 0126c6a but you will see compile errors like
```
 Box::pin(OpenAIEventStream::new(event_source))
    |         ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ `*mut u8` cannot be sent between threads safely
```
which seem to appear from nowhere. As I understand, you were trying to return an opaque dyn trait object that implements Stream. But I don't think this opaqueness is necessary as long as OpenAIEventStream also impls Stream.

The original WASM request was for platforms like Cloudflare Workers, would this work? Perhaps add another self contained example?

I think it will work nonetheless. Haven't tries these though. As far as I know wasm32-unknown-unknown is the minimum target in wasm family, which basically targets any web browsers. Runtimes or platforms of wasm other than web browsers support more functionalities and targets like wasm-wasi. The differences between wasm32-unknown-unknown and wasm-wasi see https://users.rust-lang.org/t/wasm32-unknown-unknown-vs-wasm32-wasi/78325 I think as long as this compiles for wasm32-unknown-unknown, any wasm runtimes or platforms can work.

It is not clear which APIs are supported by this PR in wasm and which aren't unless someone goes through code.

Basically anything related to files is not supported, except finetuning since it only needs strings in the code, not directly related to files. I will write more in the documentation, though. Future work can be removing Pathbuf from data structures so that any structs related to media hold bytes or byte streams, then web developers can use web technologies to upload media first.

ifsheldon · 2023-10-21T07:31:14Z

@64bit I've added documentation and an example. Can you review these? Thanks!

64bit · 2023-10-22T00:45:33Z

Thank you for updates, please expect some delays as previously mentioned, to provide you context - some older PRs & spec update issues needs attention and given that this is a big new feature and requires testing - it might be a while before I get to this. In the meantime would you mind closing your other PRs which are no longer relevant? Thank you for your patience.

ifsheldon · 2023-11-08T17:47:38Z

@64bit do you have any plan to merge this? or any comments? I saw new features got added recently, which makes this PR more intertwined and need more testing. If you have no intention to merge this, I can close this. Or I can try to follow up new features and see if they complies on wasm.

64bit · 2023-11-08T18:26:02Z

Hi @ifsheldon ,

The primary & bare minimum purpose of this crate is to work with OpenAI API - if it doesn't work with API this crate should not exist - that's why some PRs were merged before any other open PRs.

That said, I'm sure community would love to have wasm support and I do too - however I'm just limited by my bandwidth. This PR is a big feature and requires testing and doubles the surface area for testing (wasm and non-wasm) - without breaking existing features - all that creates extra work for me for every single update now and in the future.

As much as I want to merge your contributions, above are the practical reasons that I cannot accept this PR anytime soon, I'm not sure when I could spend time on this, and so I'm really sorry about that.

It would nice to have your work in this PR published - so here are few options:

We release alpha version from this branch - but you take full ownership of maintaining the branch upto date with main - I'll just be publising alpha versions or beta, stable as they mature. [ This would still be limited by my bandwidth ]
Fork async-openai crate and create a new wasm only crate something like async-openai-wasm that way you get maximum flexibility for maintaining, updating and publishing to crates.io.

Please let me know what you think and any alternative path forward I'm open to hear them.

Thank you

ifsheldon · 2023-11-10T06:17:26Z

OK. Either way I need to maintain the code, so I'd rather not to distract developers by forking a new crate. So I think 1. is good for me. I will try to keep up with OpenAI new features soon. When it's done, you can just release an alpha.

64bit · 2023-11-11T01:15:05Z

Thank you for your willingness!

Let's ship it, I'm thinking to create 'experiments' branch to merge into and other related future changes

# Conflicts: # async-openai/Cargo.toml # async-openai/src/client.rs # async-openai/src/lib.rs # async-openai/src/types/impls.rs # async-openai/src/types/types.rs

ifsheldon · 2023-11-19T10:47:37Z

@64bit Great! I've made few changes to get features in this branch in sync with main. My example has been updated to support OpenAI APIs, and I've tested it with Azure OpenAI and OpenAI.

The next step may be to separate file paths from file binaries in Input structs. Like ImageInput has a path field which is not nice to wasm.

ifsheldon added 11 commits October 15, 2023 20:15

add feature gate for tokio

85012dd

add a struct to transform EventSource stream into OpenAI response stream

a80821d

fix problematic Event::Open branch

5a9a8f6

Merge branch 'tokio_feature_gate' into remove_tokio

8758675

# Conflicts: # async-openai/Cargo.toml # async-openai/src/client.rs

add more feature gate

ae56f5f

Merge branch 'tokio_feature_gate' into separate_tokio

7278471

remove some unnecessary feature gates

098d1d2

return OpenAIEventStream instead

0126c6a

and remove some unnecessary feature gates

add backoff feature flag

8c2d812

update reqwest-eventsource to support wasm

3b9f685

add wasm feature gate

4325393

delete unnecessary feature gates

ece463a

ifsheldon added 5 commits October 17, 2023 23:13

use short names for feature gates

c3c83e6

clean imports

e453c29

add feature descriptions on README

dbc6640

add feature descriptions in lib.rs

dc46ba7

Add azure-openai-web-app example to demonstrate building for wasm

c59a0a6

This was referenced Oct 22, 2023

add feature gate for tokio #120

Closed

add a struct to transform EventSource stream into OpenAI response stream #121

Closed

ifsheldon added 3 commits November 19, 2023 17:25

Merge branch 'main' into basic_wasm_support

a545ed3

# Conflicts: # async-openai/Cargo.toml # async-openai/src/client.rs # async-openai/src/lib.rs # async-openai/src/types/impls.rs # async-openai/src/types/types.rs

fix simple errors introduced in merge

6965f71

fix wasm build errors due to merge

7f29cd1

ifsheldon added 3 commits November 19, 2023 18:17

remove TODO

ac9d6fb

remove unnecessary feature gates

d9390ee

add OpenAI support in example

6141d5b

64bit changed the base branch from main to experiments November 25, 2023 03:38

64bit merged commit 92650e4 into 64bit:experiments Nov 25, 2023

ifsheldon mentioned this pull request Nov 26, 2023

FIleInput from memory #144

Open

64bit mentioned this pull request Jan 10, 2024

Add cloudflare wasm worker example #178

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add basic wasm support #122

add basic wasm support #122

ifsheldon commented Oct 16, 2023

64bit commented Oct 17, 2023

ifsheldon commented Oct 17, 2023 •

edited

ifsheldon commented Oct 21, 2023

64bit commented Oct 22, 2023

ifsheldon commented Nov 8, 2023

64bit commented Nov 8, 2023 •

edited

ifsheldon commented Nov 10, 2023

64bit commented Nov 11, 2023

ifsheldon commented Nov 19, 2023 •

edited

add basic wasm support #122

add basic wasm support #122

Conversation

ifsheldon commented Oct 16, 2023

64bit commented Oct 17, 2023

ifsheldon commented Oct 17, 2023 • edited

ifsheldon commented Oct 21, 2023

64bit commented Oct 22, 2023

ifsheldon commented Nov 8, 2023

64bit commented Nov 8, 2023 • edited

ifsheldon commented Nov 10, 2023

64bit commented Nov 11, 2023

ifsheldon commented Nov 19, 2023 • edited

ifsheldon commented Oct 17, 2023 •

edited

64bit commented Nov 8, 2023 •

edited

ifsheldon commented Nov 19, 2023 •

edited