These changes add handling of an additional parameter "--parallel-test" #1527

bicarbon8 · 2014-05-13T12:14:43Z

These changes add handling of an additional parameter "--parallel-test" that takes a number as input. This number will then be used to limit execution of examples to the specified number of parallel threads. To accomplish this the ExampleGroups each are started in a sub-thread which can then start up to the specified number of example threads running in parallel. A global maximum is used to prevent going over the specified number and execution of additional threads will wait until an available thread is free for cases where the maximum number of parallel threads are in use.

Additionally, these changes ensure that Before / After Suite and Before / After All calls are executed only once per expected grouping, but that Before / After Each calls are executed before and after each example as part of the parallel thread.

The advantage of this change is that it fully supports all platforms supported by rspec (unlike parallel_tests gem which does not work correctly on all Windows systems) and the output remains serially reported so you avoid the need to stitch together test reports at the completion of testing (as can be required with solutions like parallel_tests and prspec gems where parallel execution is actually split between multiple instances of rspec).

This is also an attempt to partially cover the request at: #1254

…t" that takes a number as input. This number will then be used to limit execution of examples to the specified number of parallel threads. To accomplish this the ExampleGroups each are started in a sub-thread which can then start up to the specified number of example threads running in parallel. A global maximum is used to prevent going over the specified number and execution of additional threads will wait until an available thread is free for cases where the maximum number of parallel threads are in use. Additionally, these changes ensure that Before / After Suite and Before / After All calls are executed only once per expected grouping, but that Before / After Each calls are executed before and after each example as part of the parallel thread. The advantage of this change is that it fully supports all platforms supported by rspec (unlike parallel_tests gem which does not work correctly on all Windows systems) and the output remains serially reported so you avoid the need to stitch together test reports at the completion of testing (as can be required with solutions like parallel_tests and prspec gems where parallel execution is actually split between multiple instances of rspec).

JonRowe · 2014-05-13T12:58:36Z

lib/rspec/core.rb

@@ -1,4 +1,6 @@
 $_rspec_core_load_started_at = Time.now
+require 'thread'


If possible, this needs delaying until someone actively uses the feature, reason being is we try very hard not to pollute a users code base with std library stuff, which this would.

I added this here because the threaded execution, in this proposed implementation, becomes the default, but with a maximum threadcount of 1. I can certainly move it to elsewhere, but it might then need to go in the runner.rb file instead which is still pretty core.

I mean I'd like it contextually loaded when someone uses the feature, if they don't use the feature I really don't want this required.

I'll have to create a completely separate execution path to support contextual loading which will lead to greater maintenance cost since there will be what is effectively a run and run_parallel method instead of handling everything in the run method. I'll commit the changes in as surgical a fix as possible in case you decide that the approach is not as good.

JonRowe · 2014-05-13T13:10:04Z

Good start, I've left some feedback, you'll notice this completely breaks the travis build, that needs fixing.

bicarbon8 · 2014-05-13T13:41:39Z

Thanks for all the great feedback. As to the travis build, is there any way to get better information on why it failed? I tried looking around, but didn't see any error logging that could explain why it failed (looks like a timeout)

myronmarston · 2014-05-13T16:51:42Z

I tried looking around, but didn't see any error logging that could explain why it failed (looks like a timeout)

The time out is why it failed. The test suite reached a point where it didn't emit any output for 10 minutes and when that happens travis kills the process and fails the build as it usually indicates dead lock (or something similar).

myronmarston · 2014-05-13T16:52:14Z

I definitely want to review this at some point but probably won't get around to it for a bit.

cupakromer · 2014-05-13T19:46:47Z

lib/rspec/core/example_group.rb

+        filtered_examples.map do |example|
+          example.succeeded?
+        end.all?
+      end


all? can take a block. Since succeeded? is simply an attr and has no side-effects, why not leverage that:

filtered_examples.all?{ |example| example.succeeded? }

Removing file-based mutex and replacing with Mutex class Fixing multiline blocks to use 'do' instead of '{' Removing extra comments Switched environment to POSIX git commits Fixed spacing on default method values '=' became ' = ' Removed changes to Example class in favour of using metadata for example success Switched to .all? {...} from .map {...}.all? Corrected method parameter reference from [int] to [Integer]

…on Pull Request #1527 feedback Removes need to pass 'num_threads' directly from options into execution methods and then uses config values to specify the thread max limit

… high up the thread chain resulting in less optimal thread utilization

JonRowe · 2014-05-14T22:29:48Z

lib/rspec/core/example_thread_runner.rb

+      # Method will check global utilization of threads and if that number is
+      # at or over the allocated maximum it will wait until a thread is available
+      def wait_for_available_thread
+        # wait for available thread if we've reached our global limit


Remove comment.

…through the calls fixing line endings to use POSIX style (hopefully) Removing inline comments from code Reverted runner.rb self.run method back to original state Modified how the result of testing is collected in runner.rb run_specs_parallel so now it will only return an error code if there is an exception in executing the example or its before or after blocks

bicarbon8 · 2014-05-15T09:45:26Z

Thanks again for the feedback. https://github.com/bicarbon8/rspec-core/commit/708e869b3d019901be8cc0fbcb9b680b60add27c attempts to address the latest round of comments with the exception of changing the Thread.start { ... } multiline usage to a 'do ... end' block. I cannot change this due to the design of the Thread object which returns

rspec-core-3.0.0.beta2/lib/rspec/core/example_group_thread_runner.rb:16:in `initialize': must be called with a block (ThreadError)

unless the curly braces are used. Hopefully this is ok for the two instances where this is used.

JonRowe · 2014-05-15T09:47:48Z

Oh, you'll need to separate the lines so the blocks go to the right method.

bicarbon8 · 2014-05-15T10:12:50Z

Hi @JonRowe ,

I'm not sure I get what you mean there, so if you have a second to explain a bit more I'd appreciate it.

Also, I just realized that I haven't removed one of the globals so I still need to take care of that.

EDIT
removed other global in https://github.com/bicarbon8/rspec-core/commit/28a18e5176f733b5f3038ba6c09ad3a918c4c482

…ow passes the variable by reference to the areas where needed.

…overage metrics

…d of build machine

…ing in method call and undefined variable in cucumber test

…entation coverage

bicarbon8 · 2014-05-15T17:24:49Z

Looks like I finally managed to get some passing builds. For those that didn't pass I'm seeing the following error:

/home/travis/build/rspec/rspec-core/lib/rspec/core/configuration.rb:1052:in `load': /home/travis/build/rspec/rspec-core/spec/command_line/parallel_spec.rb:83: undefined (?...) sequence: /Finished in (?<match>.*) second/ (SyntaxError)

It looks like there might be a problem with the version of Ruby's understanding of how I'm extracting the number of seconds the tests took. I would appreciate some help in finding a more cross-version compatible way of doing the same thing:

def get_seconds(output_str)
  return output_str[/Finished in (?<match>.*) second/, "match"]
end

Thanks in advance for your help!

cupakromer · 2014-05-15T17:55:15Z

Ruby 1.8.7 does not support named matches. You'll need to refer to the older docs for proper usage: http://ruby-doc.org/core-1.8.7/String.html#method-i-5B-5D

Looks like you have to pull the match position out of the matcher data.

output_str[/Finished in (.*) second/, 1]

…f ruby

bicarbon8 · 2014-05-15T20:54:52Z

thanks for the help @cupakromer

I thought that would be the last bit to get a passing build, but it appears there might still be a problem with jruby-18mode as there was a timeout in the build 😦

JonRowe · 2014-05-15T22:38:38Z

I'll restart it for you, see if it was transitory, but we do need this work on JRuby as JRuby and RBX will benefit the most.

myronmarston · 2014-05-16T04:26:44Z

lib/rspec/core/example_group.rb

+          before_context_ivars.clear
+          reporter.example_group_finished(self)
+        end
+      end


This looks like it has a lot in common with run. We should look for a way to extract the common logic -- otherwise it'll add a maintenance burden to keep them in sync. Same with run_examples_parallel.

Actually, I wonder if we can come up with an injectable run strategy abstraction that can have parallel and serial implementations?

changed "examplegroup" variable to be "example_group" modified wait_for_completion methods in example_thread_runner.rb and example_group_thread_runner.rb to join to the sub-threads instead of polling modified wait_for_available_thead method in example_thread_runner.rb to only sleep for 0.1 seconds instead of 1 second to reduce wait overhead while polling for an available thread to run within changed doublequotes to singlequotes in "spec/command_line/parallel_spec.rb" to avoid the need for escaping

xaviershay · 2014-05-16T15:00:20Z

Sorry I'm so late on this, looks like there has already been a lot of feedback.

I feel like this is "tacking on" the feature, rather than integrating it properly with RSpec, and that we need to take a radically different approach to do this in a way that we can support. Namely, we need to make evaluation a first-class concept (this type of thing: https://github.com/xaviershay/xspec/blob/master/lib/xspec/evaluators.rb ), hopefully expose that publicly, then start with parallel running in a gem to test it out before bringing it in to core.

There are also a large number of things in RSpec known to be Thread-unsafe (particularly in mock setup/teardown) that need to be audited and thoroughly tested along with any parallel runner.

We also need to think about thread setup: this won't work with acceptance tests, for example, because each thread needs its own database (and possibly other dependencies as well). Even if we don't tackle this right away, we need to not paint ourselves into a corner so that it can potentially be solved later.

I'm interested in exploring this further post 3.0 release.

myronmarston · 2014-05-17T18:03:20Z

I agree with @xaviershay -- I want this feature but only if we do it properly, which is going to be a lot more work than what's here, and will need to involve a full audit to deal with any thread safety issues.

Closing.

bicarbon8 · 2014-05-19T12:59:16Z

While obviously not the result I would have hoped for, I was able to take inspiration from @xaviershay and create a rspec-parallel gem that bolts itself to rspec in the same way that these changes did. I've managed to get the 2.14 version of the gem running successfully and have the basics in place for the 3.0.0.Beta2 version, but haven't yet tested it. Have a look if you're interested:
https://github.com/bicarbon8/rspec-parallel
or 'gem install rspec-parallel'

Thanks again for all the good comments and feedback!

JonRowe reviewed May 13, 2014
View reviewed changes

cupakromer reviewed May 13, 2014
View reviewed changes

bicarbon8 added 3 commits May 14, 2014 09:57

Change to how maximum number of threads is passed to execution based …

65d4d62

…on Pull Request #1527 feedback Removes need to pass 'num_threads' directly from options into execution methods and then uses config values to specify the thread max limit

Moving mutex down to smallest possible scope. Prior placement was too…

148e081

… high up the thread chain resulting in less optimal thread utilization

JonRowe reviewed May 14, 2014
View reviewed changes

bicarbon8 added 5 commits May 15, 2014 11:21

removes additional global missed in last commit ($used_threads) and n…

28a18e5

…ow passes the variable by reference to the areas where needed.

Adding rspec tests for parallel execution to help increase the code c…

3ace928

…overage metrics

adding a 1 second buffer to the expected duration to account for spee…

54b876e

…d of build machine

correcting additional test failures pertaining to default value handl…

060cb34

…ing in method call and undefined variable in cucumber test

Adding YARD documentation to the new class files to ensure 100% docum…

9603ae4

…entation coverage

modifying regex string matcher to be compatible with older versions o…

5145ed9

…f ruby

myronmarston reviewed May 16, 2014
View reviewed changes

myronmarston closed this May 17, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

These changes add handling of an additional parameter "--parallel-test" #1527

These changes add handling of an additional parameter "--parallel-test" #1527

bicarbon8 commented May 13, 2014

JonRowe May 13, 2014

bicarbon8 May 13, 2014

JonRowe May 13, 2014

bicarbon8 May 14, 2014

JonRowe commented May 13, 2014

bicarbon8 commented May 13, 2014

myronmarston commented May 13, 2014

myronmarston commented May 13, 2014

cupakromer May 13, 2014

JonRowe May 13, 2014

JonRowe May 14, 2014

bicarbon8 commented May 15, 2014

JonRowe commented May 15, 2014

bicarbon8 commented May 15, 2014

bicarbon8 commented May 15, 2014

cupakromer commented May 15, 2014

bicarbon8 commented May 15, 2014

JonRowe commented May 15, 2014

myronmarston May 16, 2014

xaviershay commented May 16, 2014

myronmarston commented May 17, 2014

bicarbon8 commented May 19, 2014

		@@ -1,4 +1,6 @@
		$_rspec_core_load_started_at = Time.now
		require 'thread'

These changes add handling of an additional parameter "--parallel-test" #1527

These changes add handling of an additional parameter "--parallel-test" #1527

Conversation

bicarbon8 commented May 13, 2014

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JonRowe commented May 13, 2014

bicarbon8 commented May 13, 2014

myronmarston commented May 13, 2014

myronmarston commented May 13, 2014

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bicarbon8 commented May 15, 2014

JonRowe commented May 15, 2014

bicarbon8 commented May 15, 2014

bicarbon8 commented May 15, 2014

cupakromer commented May 15, 2014

bicarbon8 commented May 15, 2014

JonRowe commented May 15, 2014

Choose a reason for hiding this comment

xaviershay commented May 16, 2014

myronmarston commented May 17, 2014

bicarbon8 commented May 19, 2014