docs: tracing and configuration

depends on bentoml#3052 Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
aarnphm · Oct 5, 2022 · 9d5a5a0 · 9d5a5a0
1 parent 1533da5
commit 9d5a5a0
Show file tree

Hide file tree

Showing 7 changed files with 531 additions and 145 deletions.
diff --git a/docs/source/_static/img/jaeger-ui.png b/docs/source/_static/img/jaeger-ui.png
diff --git a/docs/source/concepts/runner.rst b/docs/source/concepts/runner.rst
@@ -2,6 +2,11 @@
 Using Runners
 =============
 
+*time expected: 15 minutes*
+
+This page articulates on the concept of Runners and demonstrates its role within the
+BentoML architecture.
+
 What is Runner?
 ---------------
 
@@ -56,6 +61,10 @@ methods.
 Custom Runner
 -------------
 
+For more advanced use cases, BentoML also allows users to define their own Runner
+classes. This is useful when the pre-built Runners do not meet the requirements, or
+when the user wants to implement a Runner for a new ML framework.
+
 Creating a Runnable
 ^^^^^^^^^^^^^^^^^^^
 
@@ -327,6 +336,7 @@ Runner Configuration
 --------------------
 
 Runner behaviors and resource allocation can be specified via BentoML :ref:`configuration <guides/configuration:Configuring BentoML>`.
+
 Runners can be both configured individually or in aggregate under the ``runners`` configuration key. To configure a specific runner, specify its name
 under the ``runners`` configuration key. Otherwise, the configuration will be applied to all runners. The examples below demonstrate both
 the configuration for all runners in aggregate and for an individual runner (``iris_clf``).
@@ -340,29 +350,29 @@ To explicitly disable or control adaptive batching behaviors at runtime, configu
 .. tab-set::
 
     .. tab-item:: All Runners
-        :sync: all_runners
+       :sync: all_runners
+
+       .. code-block:: yaml
+          :caption: ⚙️ `configuration.yml`
+
+          runners:
+            batching:
+              enabled: true
+              max_batch_size: 100
+              max_latency_ms: 500
 
-        .. code-block:: yaml
-	    :caption: ⚙️ `configuration.yml`
-
-            runners:
-                batching:
-                    enabled: true
-                    max_batch_size: 100
-                    max_latency_ms: 500
-
     .. tab-item:: Individual Runner
         :sync: individual_runner
-        
+
         .. code-block:: yaml
-	    :caption: ⚙️ `configuration.yml`
+           :caption: ⚙️ `configuration.yml`
 
-            runners:
-                iris_clf:
-                    batching:
-                        enabled: true
-                        max_batch_size: 100
-                        max_latency_ms: 500
+           runners:
+             iris_clf:
+               batching:
+                 enabled: true
+                 max_batch_size: 100
+                 max_latency_ms: 500
 
 Resource Allocation
 ^^^^^^^^^^^^^^^^^^^
@@ -373,53 +383,53 @@ through configuration, with a `float` value for ``cpu`` and an `int` value for `
 .. tab-set::
 
     .. tab-item:: All Runners
-        :sync: all_runners
+       :sync: all_runners
 
-        .. code-block:: yaml
-	    :caption: ⚙️ `configuration.yml`
+       .. code-block:: yaml
+          :caption: ⚙️ `configuration.yml`
+
+          runners:
+            resources:
+              cpu: 0.5
+              nvidia.com/gpu: 1
 
-            runners:
-                resources:
-                    cpu: 0.5
-                    nvidia.com/gpu: 1
-
     .. tab-item:: Individual Runner
         :sync: individual_runner
-        
+
         .. code-block:: yaml
-	    :caption: ⚙️ `configuration.yml`
+           :caption: ⚙️ `configuration.yml`
 
-            runners:
-                iris_clf:
-                    resources:
-                        cpu: 0.5
-                        nvidia.com/gpu: 1
+           runners:
+             iris_clf:
+               resources:
+                 cpu: 0.5
+                 nvidia.com/gpu: 1
 
 Alternatively, a runner can be mapped to a specific set of GPUs. To specify GPU mapping, instead of defining an `integer` value, a list of device IDs
 can be specified for the ``nvidia.com/gpu`` key. For example, the following configuration maps the configured runners to GPU device 2 and 4.
 
 .. tab-set::
 
     .. tab-item:: All Runners
-        :sync: all_runners
+       :sync: all_runners
 
-        .. code-block:: yaml
-	    :caption: ⚙️ `configuration.yml`
+       .. code-block:: yaml
+          :caption: ⚙️ `configuration.yml`
+
+          runners:
+            resources:
+              nvidia.com/gpu: [2, 4]
 
-            runners:
-                resources:
-                    nvidia.com/gpu: [2, 4]
-
     .. tab-item:: Individual Runner
-        :sync: individual_runner
-        
-        .. code-block:: yaml
-	    :caption: ⚙️ `configuration.yml`
+       :sync: individual_runner
+
+       .. code-block:: yaml
+          :caption: ⚙️ `configuration.yml`
 
-            runners:
-                iris_clf:
-                    resources:
-                        nvidia.com/gpu: [2, 4]
+          runners:
+            iris_clf:
+              resources:
+                nvidia.com/gpu: [2, 4]
 
 Timeout
 ^^^^^^^
@@ -429,23 +439,23 @@ Runner timeout defines the amount of time in seconds to wait before calls a runn
 .. tab-set::
 
     .. tab-item:: All Runners
-        :sync: all_runners
+       :sync: all_runners
 
-        .. code-block:: yaml
-	    :caption: ⚙️ `configuration.yml`
+       .. code-block:: yaml
+          :caption: ⚙️ `configuration.yml`
+
+          runners:
+            timeout: 60
 
-            runners:
-                timeout: 60
-
     .. tab-item:: Individual Runner
-        :sync: individual_runner
-        
-        .. code-block:: yaml
-	    :caption: ⚙️ `configuration.yml`
+       :sync: individual_runner
+
+       .. code-block:: yaml
+          :caption: ⚙️ `configuration.yml`
 
-            runners:
-                iris_clf:
-                    timeout: 60
+          runners:
+            iris_clf:
+              timeout: 60
 
 Access Logging
 ^^^^^^^^^^^^^^