Fixed #34 Customizing scipy's oaconvolve by NimaSarajpoor · Pull Request #35 · stumpy-dev/sliding_dot_product

NimaSarajpoor · 2026-01-08T15:35:08Z

This PR is to address #34.

gitnotebooks · 2026-01-08T15:35:11Z

Review these changes at https://app.gitnotebooks.com/stumpy-dev/sliding_dot_product/pull/35

NimaSarajpoor · 2026-01-08T15:56:52Z

./timing.py -timeout 1.0 -pmin 7 -pmax 24 pyfftw pocketfft_r2c_c2r scipy_oaconvolve challenger > timing.csv

# in timing.py, I change timeout to 5.0 when `len(T) >= 2^20`

The challenger is the customized version of scipy's oaconvolve.

Observations:

The challenger outperforms scipy's oaconvolve
For len(Q) <= 2 ^16 (and len(Q)>= 2^7), challenger outperforms the others for the most part.
For len(Q) > 2^16, pocketfft outperforms the others for the most part.

For me, the important one is the first bullet point. Of the four optimization opportunities mentioned in this comment, I've addressed 1, 2, and 3 in this PR. The last item, which is about adjusting the number of multiplication for real-valued arrays, can be explored next.

seanlaw · 2026-01-09T01:32:35Z

As a gentle reminder, even if we can do things faster, we will never (??) remove the public scipy convolution functions from STUMPY because they should be our last resort fallback (in case the alternatives, that may use private functions, raise an error). Does that make sense?

NimaSarajpoor · 2026-01-09T02:19:50Z

Good reminder. It makes sense!!

NimaSarajpoor · 2026-01-09T18:29:56Z

+
+
+def test_oaconvolve_sdp_blocksize():
+    from sdp.challenger_sdp import sliding_dot_product


This line needs to be modified if, at a later time, we decide to move the proposal to a new file (module).

NimaSarajpoor · 2026-01-09T20:15:08Z

./timing.py -timeout 1.0 -pmin 7 -pmax 24 pyfftw pocketfft_r2c_c2r scipy_oaconvolve challenger > timing.csv

# in timing.py, I change timeout to 5.0 when `len(T) >= 2^20`

@seanlaw
"Challenger" seems to be the winner for most cases, and I think it is worth it to include it. What do you think? Also, can you please review the script? I've made major changes. The private objects are now only r2c and c2r. IMO, the script looks cleaner now.

seanlaw

@NimaSarajpoor I've left some comments but would still like another pass after you've cleaned things up further

I do agree that, for the most part, things look clean. I think it still lacks clarity as to what is happening or why the logic is coded in this way

NimaSarajpoor · 2026-05-17T18:55:24Z

+    if conv_block_size is None:
+        # Find optimal block_size based on m and n
+        if m >= n / 2:
+            conv_block_size = n  # i.e. no blocking


Not clear why this condition is needed ?

NimaSarajpoor · 2026-05-20T00:48:12Z


-def setup(Q, T):
-    return
+def _compute_block_size(m, n, conv_block_size=None):


According to my exploration, replacing this with scipy's function _calc_oa_lens results in a considerable performance hit. I am passing real=True in next_fast_len(math.ceil(opt_size), real=True) in the function here; however, the scipy's function (internally) uses the default real=False.

I've checked the performance of the challenger with real=False to see its impact on the performance. As shown in the figure below, challenger (in which real=True is passed to next_fast_len) shows a better performance.

rm -rf sdp/__pycache__ ./timing.py -timeout 1.0 -pmin 7 -pmax 20 pyfftw challenger challenger_real_false > timing.csv rm -rf sdp/__pycache__

NimaSarajpoor added 3 commits January 7, 2026 23:32

modified oaconvolve

72f0fb2

update code logic

be0035e

minor clean ups

907e2e2

NimaSarajpoor commented Jan 8, 2026

View reviewed changes

NimaSarajpoor added 3 commits January 9, 2026 10:16

major changes to imporve readability

969f187

Add param blocksize

b366e35

add temp test for challenger

a3d2e44

NimaSarajpoor commented Jan 9, 2026

View reviewed changes

NimaSarajpoor requested a review from seanlaw January 9, 2026 20:39

NimaSarajpoor commented Jan 9, 2026

View reviewed changes

Comment thread sdp/challenger_sdp.py Outdated

seanlaw requested changes Jan 9, 2026

View reviewed changes

Comment thread sdp/challenger_sdp.py Outdated

Comment thread sdp/challenger_sdp.py Outdated

Comment thread sdp/challenger_sdp.py Outdated

Comment thread sdp/challenger_sdp.py

NimaSarajpoor commented Jan 10, 2026

View reviewed changes

Comment thread sdp/challenger_sdp.py Outdated

NimaSarajpoor added 2 commits January 9, 2026 23:53

add func for computing block size

22d233b

remove redundant code

d6eedfa

NimaSarajpoor commented Jan 11, 2026

View reviewed changes

Comment thread sdp/challenger_sdp.py

Comment thread sdp/challenger_sdp.py Outdated

Comment thread sdp/challenger_sdp.py Outdated

NimaSarajpoor added 7 commits January 11, 2026 14:50

added clearer functions

4e23694

minor change

01a0e7a

minor change to help with future refactoring

dddb708

minor change

41db845

Added reference for finding optimal block size

99e450b

fixed test

e8fa331

revise comment

f45f541

NimaSarajpoor commented Jan 12, 2026

View reviewed changes

Comment thread sdp/challenger_sdp.py Outdated

NimaSarajpoor added 2 commits January 13, 2026 13:33

removed overlap-add explanation. Created PR#36 instead

39e936c

renaming private functions to reflect valid convolution

f6fed15

NimaSarajpoor commented May 17, 2026

View reviewed changes

Merge branch 'main' into oaconvolve

ccbb651

NimaSarajpoor mentioned this pull request May 19, 2026

What is the optimal size of block in the overlap-add method? #37

Open

NimaSarajpoor added 3 commits May 18, 2026 22:58

address comments

1033584

add docstrings and comments

d548d0d

improved docstrings and comments

c78d67b

NimaSarajpoor commented May 20, 2026

View reviewed changes



		def test_oaconvolve_sdp_blocksize():
		from sdp.challenger_sdp import sliding_dot_product

Conversation

NimaSarajpoor commented Jan 8, 2026

Uh oh!

gitnotebooks Bot commented Jan 8, 2026

Uh oh!

NimaSarajpoor commented Jan 8, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

seanlaw commented Jan 9, 2026

Uh oh!

NimaSarajpoor commented Jan 9, 2026

Uh oh!

NimaSarajpoor Jan 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

NimaSarajpoor commented Jan 9, 2026

Uh oh!

Uh oh!

seanlaw left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

NimaSarajpoor May 17, 2026

Choose a reason for hiding this comment

Uh oh!

NimaSarajpoor May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NimaSarajpoor May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

seanlaw left a comment •

edited

Loading

NimaSarajpoor May 20, 2026 •

edited

Loading

NimaSarajpoor May 20, 2026 •

edited

Loading