Issue 7462: Implement fastsearch algorithm for rfind/rindex (original) (raw)

Created on 2009-12-08 22:46 by flox, last changed 2022-04-11 14:56 by admin. This issue is now closed.

Files
File name	Uploaded	Description	Edit
issue7462_string_tests.diff	flox,2009-12-20 22:55	Patch, apply to trunk
issue7462_fastsearch_v3.diff	flox,2009-12-21 13:30	Patch, apply to trunk
stringbench_fixes_and_additions_v3.diff	flox,2009-12-21 21:17	Patch for sandbox/trunk/stringbench/
bench_rfind_algorithms_v3.diff	flox,2009-12-21 21:29	Benchmark Results (including rpartition and rsplit)

Messages (25)
msg96157 - (view)	Author: Florent Xicluna (flox) *	Date: 2009-12-08 22:46
While looking at I stopped on: /* XXX - create reversefastsearch helper! */ Here is the patch which is just the translation of the same algorithm already implemented for "find/index". http://effbot.org/zone/stringlib.htm Note: it supersedes patch for 7458.
msg96163 - (view)	Author: Florent Xicluna (flox) *	Date: 2009-12-09 00:24
(removed comment which should not be copied)
msg96724 - (view)	Author: Florent Xicluna (flox) *	Date: 2009-12-20 22:55
Additional test cases for rfind.
msg96725 - (view)	Author: Florent Xicluna (flox) *	Date: 2009-12-20 23:08
Bench results show the benefit. And attached patch for stringbench tool.
msg96726 - (view)	Author: Florent Xicluna (flox) *	Date: 2009-12-20 23:26
There's a typo in the patch for stringbench. Updated patch (with rindex tests, too).
msg96727 - (view)	Author: Florent Xicluna (flox) *	Date: 2009-12-20 23:55
Updated benchmarks. str unicode (ms) (ms) ========== late match, 100 characters s="ABC"33; ("E"+s+("D"+s)500).rfind("E"+s) 32.89 15.65 rfind (classic) 32.81 15.63 rindex (classic) 11.77 13.27 rfind (fastsearch) 11.78 13.40 rindex (fastsearch) ========== late match, two characters 4.34 2.36 ("C"+"AB"300).rfind("CA") (classic) 4.44 2.36 ("C"+"AB"300).rindex("CA") (classic) 2.10 2.31 ("C"+"AB"300).rfind("CA") (fastsearch) 2.10 2.32 ("C"+"AB"300).rindex("CA") (fastsearch) ========== no match, two characters 14.12 13.46 ("AB"1000).rfind("BC") (classic) 7.67 8.26 ("AB"1000).rfind("BC") (fastsearch)
msg96729 - (view)	Author: Florent Xicluna (flox) *	Date: 2009-12-21 00:43
Need additional patch for rpartition and rsplit on string, unicode and bytearray objects.
msg96738 - (view)	Author: Florent Xicluna (flox) *	Date: 2009-12-21 11:32
Updated patch with optimization for: * rfind * rindex * rsplit * rpartition And an optimization was implemented in r46398 but never used because "#define USE_FAST" was removed 2 hours before: r46366. ==> I changed this to activate optimization on "S.split" too. I propose to commit these changes before looking for further improvements. (I plan to refactor some functions and macro to Objects/stringlib)
msg96739 - (view)	Author: Florent Xicluna (flox) *	Date: 2009-12-21 11:33
Actually, the USE_FLAG macro was removed in r46367 (not 46366). « needforspeed: remove remaining USE_FAST macros; if fastsearch was broken, someone would have noticed by now ;-) »
msg96743 - (view)	Author: Florent Xicluna (flox) *	Date: 2009-12-21 12:36
I reupload the patch fixed (sorry).
msg96747 - (view)	Author: Antoine Pitrou (pitrou) *	Date: 2009-12-21 12:51
Some things: - is STRINGLIB_CMP still used? if not, it should be removed (use grep :-)) - please don't use "#if 1" - if USE_FAST isn't used anymore, it should be remove as well - did you check that rpartition, split and friends were sped up by the patch? Thanks for working on this!
msg96748 - (view)	Author: Florent Xicluna (flox) *	Date: 2009-12-21 13:01
Actually I see different macros which do the same thing: I will consider reusing STRINGLIB_CMP to re-define PyUNICODE_MATCH and PySTRING_MATCH (or create aliases if they have the same signature). I can prepare a "all-in-one" patch which fixes all these things. But don't you think we should do this incrementally, i.e. commit the current patch before refactoring more code? I will remove the "#if 1 /* USE_FAST */".
msg96749 - (view)	Author: Antoine Pitrou (pitrou) *	Date: 2009-12-21 13:06
> Actually I see different macros which do the same thing: I will consider > reusing STRINGLIB_CMP to re-define PyUNICODE_MATCH and PySTRING_MATCH > (or create aliases if they have the same signature). STRINGLIB_CMP, as the name implies, should only be used by stringlib. (anything which doesn't start with Py* shouldn't be exported in the official include files) > But don't you think we should do this incrementally, i.e. commit the > current patch before refactoring more code? Well, if STRINGLIB_CMP isn't used anymore, removing it should be part of the current issue. There's no reason to leave dead code in the source tree. I agree that further cleanups can be part of another issue.
msg96750 - (view)	Author: Florent Xicluna (flox) *	Date: 2009-12-21 13:30
Removing "slow" parts and unnused macros STRINGLIB_CMP and USE_FAST.
msg96757 - (view)	Author: Antoine Pitrou (pitrou) *	Date: 2009-12-21 16:12
I haven't reviewed the algorithm (I don't know if I will, honestly), but at least on the principle this looks good. Fredrik, do you want to take a look?
msg96771 - (view)	Author: Florent Xicluna (flox) *	Date: 2009-12-21 21:29
New benchmarks (and patch for the benchmark tool). Best improvement is reached with such expression: s="ABC"33; (s+"E"+("D"+s)500).rfind(s+"E") (*100) String (classic): 93.14 ms String (fast): 8.78 ms Unicode (classic): 78.62 ms Unicode (fast): 9.42 ms
msg97039 - (view)	Author: Antoine Pitrou (pitrou) *	Date: 2009-12-30 16:38
Looking a bit more at the patch: + /* miss: check if previous character is part of pattern / + if (!(mask & (1 << (s[i-1] & 0x1F)))) From what I understand, this should be s[i-m]. Same here: + / skip: check if previous character is part of pattern */ + if (!(mask & (1 << (s[i-1] & 0x1F))))
msg97084 - (view)	Author: Florent Xicluna (flox) *	Date: 2009-12-31 10:03
I checked the code, and it is the right thing: Example 1 (fastsearch): s, p = "ABCDEABCF", "BCD" s.rfind(p) # i = 6 is first candidate, but "BCF" != "BCD" # then s[i-1] = "A" is not part of pattern # so we skip 3 chars: next loop is i = 2 # etc... Example 2 (): s, p = "ABCDBCBCF", "BCB" s.rfind(p) # i = 6 is first candidate, but "BCF" != "BCB" # then s[i-m] = "D" is not part of pattern # so we skip 3 chars: next loop is i = 2.. and it fails! I've tested many examples to understand and verify the algorithm.
msg97087 - (view)	Author: Antoine Pitrou (pitrou) *	Date: 2009-12-31 12:25
> I checked the code, and it is the right thing: Indeed, you are right. My bad!
msg97103 - (view)	Author: Antoine Pitrou (pitrou) *	Date: 2009-12-31 16:50
I will take a last look at it and commit if I see nothing wrong.
msg97105 - (view)	Author: Raymond Hettinger (rhettinger) *	Date: 2009-12-31 17:51
Are there any simple, common cases that are made slower by this patch? IIRC, that was the reason this wasn't implemented previously.
msg97106 - (view)	Author: Florent Xicluna (flox) *	Date: 2009-12-31 17:57
The benchmark tests show significant improvements in most cases up to 10 times faster. And I found no test case which show performance regression (using the "stringbench" benchmark from GvR, and additional tests). Moreover, the very same algorithm is already implemented for find/index/partition.
msg97146 - (view)	Author: Antoine Pitrou (pitrou) *	Date: 2010-01-02 20:41
I've added a version number to stringbench and committed the changes in r77240.
msg97148 - (view)	Author: Antoine Pitrou (pitrou) *	Date: 2010-01-02 21:42
The main patch has been committed in r77241 (trunk) and r77246 (py3k). I've ommitted the tests you had added for . Thank you!
msg97200 - (view)	Author: Fredrik Lundh (effbot) *	Date: 2010-01-04 09:20
Thanks Florent! > Are there any simple, common cases that are made slower by this patch? The original fastsearch implementation has a couple of special cases to make sure it's faster than the original code in all cases. The reason it wasn't implemented for reverse search was more a question of developer time constraints; reverse search isn't nearly as common as forward search, and we had other low-hanging fruit to deal with. (btw, while it's great that someone finally got around to fix this, it wouldn't surprise me if replacing the KMP implementation in SRE with a fastsearch would save as many CPU cycles worldwide as this patch :)

History
Date	User	Action	Args
2022-04-11 14:56:55	admin	set	github: 51711
2010-01-04 09:26:26	ezio.melotti	set	nosy: + ezio.melotti
2010-01-04 09:20:20	effbot	set	messages: +
2010-01-02 21:42:27	pitrou	set	status: open -> closedresolution: fixedmessages: + stage: commit review -> resolved
2010-01-02 20:41:43	pitrou	set	messages: +
2009-12-31 17:57:57	flox	set	messages: +
2009-12-31 17:51:08	rhettinger	set	nosy: + rhettingermessages: +
2009-12-31 16:50:30	pitrou	set	priority: normalversions: - Python 2.6, Python 3.1messages: + assignee: pitroustage: patch review -> commit review
2009-12-31 12:25:14	pitrou	set	messages: +
2009-12-31 10:03:44	flox	set	messages: +
2009-12-30 16:38:58	pitrou	set	messages: +
2009-12-21 21:29:27	flox	set	files: - bench_rfind_algorithms_v2.diff
2009-12-21 21:29:17	flox	set	files: + bench_rfind_algorithms_v3.diffmessages: +
2009-12-21 21:17:17	flox	set	files: + stringbench_fixes_and_additions_v3.diff
2009-12-21 21:16:59	flox	set	files: - stringbench_fixes_and_additions_v2.diff
2009-12-21 20:55:08	flox	set	files: + stringbench_fixes_and_additions_v2.diff
2009-12-21 18:53:15	flox	set	files: - stringbench_fixes_and_additions.diff
2009-12-21 18:38:18	flox	set	files: + stringbench_fixes_and_additions.diff
2009-12-21 18:37:51	flox	set	files: - stringbench_rfind_rindex.diff
2009-12-21 16:12:47	pitrou	set	nosy: + effbotmessages: +
2009-12-21 13:31:06	flox	set	files: - issue7462_fastsearch_v2.diff
2009-12-21 13:30:58	flox	set	files: + issue7462_fastsearch_v3.diffmessages: +
2009-12-21 13:06:22	pitrou	set	messages: +
2009-12-21 13:01:45	flox	set	messages: +
2009-12-21 12:51:44	pitrou	set	nosy: + pitroumessages: +
2009-12-21 12:36:34	flox	set	files: + issue7462_fastsearch_v2.diffmessages: +
2009-12-21 12:35:54	flox	set	files: - fastsearch_rfind_v2.patch
2009-12-21 11:49:50	flox	set	files: - fastsearch_rfind.patch
2009-12-21 11:33:48	flox	set	messages: +
2009-12-21 11:32:03	flox	set	files: + fastsearch_rfind_v2.patchmessages: + stage: needs patch -> patch review
2009-12-21 00:43:47	flox	set	messages: + stage: patch review -> needs patch
2009-12-20 23:55:43	flox	set	files: - bench_rfind_algorithms.diff
2009-12-20 23:55:08	flox	set	files: + bench_rfind_algorithms_v2.diffmessages: + stage: patch review
2009-12-20 23:26:18	flox	set	files: - stringbench_rfind.diff
2009-12-20 23:26:06	flox	set	files: + stringbench_rfind_rindex.diffmessages: +
2009-12-20 23:08:35	flox	set	files: + bench_rfind_algorithms.diff
2009-12-20 23:08:14	flox	set	files: + stringbench_rfind.diffmessages: +
2009-12-20 22:55:34	flox	set	files: + issue7462_string_tests.diffmessages: +
2009-12-09 00:24:46	flox	set	files: - fastsearch_rfind.patch
2009-12-09 00:24:20	flox	set	files: + fastsearch_rfind.patchmessages: +
2009-12-08 22:46:43	flox	create