Add `Throttle` #65

programatik29 · 2022-06-07T07:20:29Z

Adds Throttle which can slow down outgoing data.

neoeinstein

Added some thoughts.

neoeinstein · 2022-06-07T11:03:52Z

http-body-util/src/throttle.rs

+    /// Will panic if milliseconds in `duration` is larger than `u32::MAX`.
+    pub fn new(body: B, duration: Duration, bytes: u32) -> Self {
+        let bytes = f64::from(bytes);
+        let duration = f64::from(u32::try_from(duration.as_millis()).expect("duration too large"));


This could instead use .as_secs_f64(). This is a units change, but by using this function below, you can keep things aligned.

Makes sense.

neoeinstein · 2022-06-07T11:04:39Z

http-body-util/src/throttle.rs

+                State::Waiting(sleep, time) => match sleep.as_mut().poll(cx) {
+                    Poll::Ready(()) => {
+                        let byte_rate = *this.byte_rate;
+                        let mut elapsed = to_f64(time.elapsed().as_millis());


.as_secs_f64()

neoeinstein · 2022-06-07T11:22:37Z

http-body-util/src/throttle.rs

+                    }
+                    Poll::Pending => return Poll::Pending,
+                },
+                State::Ready(time) => match this.inner.as_mut().poll_data(cx) {


Of note, if we get a really large, single chunk, then no real throttling of data is done. Instead, this does throttling of chunk pulls, which may be pretty coarse in practice, as the max buffer size in hyper for HTTP/1 is 408 kiB. With a Full inner response body, Throttle would send everything as a single chunk without any throttling, probably not what a user of Throttle in a response would expect.

To implement throttling regardless of chunk size, you may need to hold the underlying data as buffer to enable re-chunking the data on the way through, potentially avoiding floating-point rate calculations.

One way would be: call poll_data(), split off up to quota and send (or all if below quota), if quota reached, then save away remaining bytes and halt sending until the next time horizon. On reaching next time horizon, send quota out of remaining bytes. If bytes are exhausted, poll data again and repeat. There’s some decision to be made here between number and size of chunks and the ability to track the requested throttle rate. With large time buckets, you can keep chunks relatively large, but will end up with highly-variable instantaneous throughput. With small time buckets, you may have a smoother throughput profile, but have more overhead in the number of chunks being sent downstream.

Thoughts?

To implement throttling regardless of chunk size, you may need to hold the underlying data as buffer to enable re-chunking the data on the way through. potentially avoiding floating-point rate calculations.

I think this buffering cost should be optional to users. Maybe a Buffer body utility can be added.

Instead, this does throttling of chunk pulls, which may be pretty coarse in practice, as the max buffer size in hyper for HTTP/1 is 408 kiB.

Can't really get around that except documenting this setting and having users set it.

I think this buffering cost should be optional to users. Maybe a Buffer body utility can be added.

In practice, as the chunks are Bytes, the split operation is cheap, zero-copy, and doesn’t actually require allocating any distinct memory.

I didn't know bytes::buf::Buf::copy_to_bytes was optimized for Bytes.

One way would be: call poll_data(), split off up to quota and send (or all if below quota), if quota reached, then save away remaining bytes and halt sending until the next time horizon. On reaching next time horizon, send quota out of remaining bytes. If bytes are exhausted, poll data again and repeat. There’s some decision to be made here between number and size of chunks and the ability to track the requested throttle rate. With large time buckets, you can keep chunks relatively large, but will end up with highly-variable instantaneous throughput. With small time buckets, you may have a smoother throughput profile, but have more overhead in the number of chunks being sent downstream.

I will work on this.

neoeinstein · 2022-06-07T11:23:11Z

http-body-util/src/throttle.rs

+    use std::{convert::Infallible, time::Duration};
+    use tokio::time::Instant;
+
+    #[tokio::test(start_paused = true)]


I do so love the auto-advancing clock for tokio testing.

programatik29 added 6 commits June 4, 2022 21:04

add StreamBody

78b1c9e

improve tests of Limited by using StreamBody

aab96a3

wrap stream instead of boxing it

798681c

implement Clone, Copy and Stream traits for StreamBody

f754f2f

add throttle

3268f8c

fix cargo fmt

47d07d8

neoeinstein reviewed Jun 7, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `Throttle` #65

Add `Throttle` #65

programatik29 commented Jun 7, 2022

neoeinstein left a comment

neoeinstein Jun 7, 2022

programatik29 Jun 7, 2022

neoeinstein Jun 7, 2022

neoeinstein Jun 7, 2022 •

edited

programatik29 Jun 7, 2022

neoeinstein Jun 7, 2022 •

edited

programatik29 Jun 7, 2022

neoeinstein Jun 7, 2022

Add Throttle #65

Are you sure you want to change the base?

Add Throttle #65

Conversation

programatik29 commented Jun 7, 2022

neoeinstein left a comment

Choose a reason for hiding this comment

neoeinstein Jun 7, 2022

Choose a reason for hiding this comment

programatik29 Jun 7, 2022

Choose a reason for hiding this comment

neoeinstein Jun 7, 2022

Choose a reason for hiding this comment

neoeinstein Jun 7, 2022 • edited

Choose a reason for hiding this comment

programatik29 Jun 7, 2022

Choose a reason for hiding this comment

neoeinstein Jun 7, 2022 • edited

Choose a reason for hiding this comment

programatik29 Jun 7, 2022

Choose a reason for hiding this comment

neoeinstein Jun 7, 2022

Choose a reason for hiding this comment

Add `Throttle` #65

Add `Throttle` #65

neoeinstein Jun 7, 2022 •

edited

neoeinstein Jun 7, 2022 •

edited