Performance on large wsdls #10

phillbaker · 2018-12-05T23:48:35Z

Originally opened in https://bitbucket.org/jurko/suds/issues/9/profiling-suds, it seems that suds' performance can be improved when loading large WSDLs.

Some examples of work to this end are:

https://bitbucket.org/bluehorn/suds-performance/commits/all
https://github.com/eht16/suds-improvements/commits/master (includes most of the above)

chris-griffin · 2018-12-06T00:01:22Z

https://github.com/liboz/suds-lxml as well

ovnicraft · 2019-01-31T04:20:00Z

Between all repo mentions, whats the best option ?

phillbaker · 2019-01-31T13:28:02Z

@ovnicraft I pulled several of these commits into a branch here: https://github.com/suds-community/suds/tree/perf-wsdls

However, during testing I didn't notice any dramatic improvements - would love some help evaluating and testing.

phillbaker · 2019-02-09T00:09:05Z

To add a bit more context, most of those commits seem related to performance in the request/response cycle. The main issue we're seeing is on client boot.

There isn't much documentation on the differences in the cachingpolicy (introduced b3d6d18), however, in doing some basic tracing of client boot time:

With cachingpllicy=0

c = Client(url,cachingpolicy=0)  # first load
document.loaded 4.601478576660156e-05
sax.parse 1.555516004562378
documents.parsed 6.794929504394531e-05
open_imports 3.0994415283203125e-06
resolve 0.0265350341796875
document.loaded 5.2928924560546875e-05
sax.parse 0.013490915298461914
documents.parsed 4.1961669921875e-05
document.loaded 4.100799560546875e-05
sax.parse 4.854841947555542
documents.parsed 7.009506225585938e-05
document.loaded 4.38690185546875e-05
sax.parse 3.8335940837860107
documents.parsed 7.796287536621094e-05
document.loaded 6.008148193359375e-05
sax.parse 0.031036853790283203
documents.parsed 5.0067901611328125e-05
build_schema 28.749293088912964
set_wrapped 0.27320289611816406
add_methods 0.09953713417053223
self.fn 34.003334045410156
Factory 5.793571472167969e-05
ServiceSelector 1.0967254638671875e-05
ServiceDefinition 0.33492493629455566

c = Client(url,cachingpolicy=0)  # warmed cache
documents.parsed 6.985664367675781e-05
open_imports 7.152557373046875e-06
resolve 0.047647953033447266
documents.parsed 3.910064697265625e-05
documents.parsed 4.1961669921875e-05
documents.parsed 3.695487976074219e-05
documents.parsed 2.9087066650390625e-05
build_schema 9.905397891998291
set_wrapped 0.1327660083770752
add_methods 0.039650917053222656
self.fn 13.933005094528198
Factory 3.600120544433594e-05
ServiceSelector 6.198883056640625e-06
ServiceDefinition 0.2581620216369629

With cachingpllicy=1

c = SudsClient(url,cachingpolicy=1)  # first load
document.loaded 0.0001628398895263672
sax.parse 1.9437551498413086
documents.parsed 9.799003601074219e-05
open_imports 2.86102294921875e-06
resolve 0.024981021881103516
document.loaded 4.696846008300781e-05
sax.parse 0.012939929962158203
documents.parsed 2.6941299438476562e-05
document.loaded 3.886222839355469e-05
sax.parse 3.364743947982788
documents.parsed 5.0067901611328125e-05
document.loaded 5.698204040527344e-05
sax.parse 0.9885931015014648
documents.parsed 4.291534423828125e-05
document.loaded 5.3882598876953125e-05
sax.parse 0.013521909713745117
documents.parsed 2.7894973754882812e-05
build_schema 14.720332860946655
set_wrapped 0.13639307022094727
add_methods 0.03350090980529785
self.fn 23.030166149139404
Factory 5.602836608886719e-05
ServiceSelector 5.0067901611328125e-06
ServiceDefinition 0.21198701858520508

c = Client(url,cachingpolicy=1)  # warmed cache
cache.get 8.078610181808472
Factory 2.7894973754882812e-05
ServiceSelector 5.0067901611328125e-06
ServiceDefinition 0.24859380722045898

What's surprising is that bumping the protocol version, doesn't seem to have an effect. It seems to be set at python 2's max value:

suds/suds/cache.py

Line 312 in 6fb0a82

protocol = 2

On python 3 using pickle.HIGHEST_PROTOCOL, with a warmed cache, this is an average load:

c = Client(url, cachingpolicy=1)
cache.get 8.570900917053223
Factory 3.1948089599609375e-05
ServiceSelector 6.9141387939453125e-06
ServiceDefinition 0.23836612701416016

However, reducing garbage collection during the unpickling did seem to have a positive effect, dramatically reducing load time, along the lines of:

import gc
# disable garbage collector
gc.disable()
cache.get(...)
# enable garbage collector again
gc.enable()

emiliom mentioned this issue Sep 2, 2021

Replace suds-jurko dependency with suds-community ulmo-dev/ulmo#189

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance on large wsdls #10

Performance on large wsdls #10

phillbaker commented Dec 5, 2018

chris-griffin commented Dec 6, 2018

ovnicraft commented Jan 31, 2019

phillbaker commented Jan 31, 2019

phillbaker commented Feb 9, 2019

Performance on large wsdls #10

Performance on large wsdls #10

Comments

phillbaker commented Dec 5, 2018

chris-griffin commented Dec 6, 2018

ovnicraft commented Jan 31, 2019

phillbaker commented Jan 31, 2019

phillbaker commented Feb 9, 2019