gh-150621: avoid quadratic bytes slicing in `asyncio.protocols._feed_data_to_buffered_proto` by deadlovelll · Pull Request #150622 · python/cpython

deadlovelll · 2026-05-30T17:43:07Z

Track the offset into data instead of reslicing. Drops total work from O(N^2) to O(N).

See gh-150621 for the benchmark table.

To get the results of new version run: ./python.exe buffer_memory_view_bench.py --variant new -o new.json
To get the results of old one: ./python.exe buffer_memory_view_bench.py --variant old -o old.json
Get the compare table: ./python.exe -m pyperf compare_to old.json new.json --table

Benchmark script

import pyperf
from asyncio import BufferedProtocol


def feed_old(proto, data):
    data_len = len(data)
    while data_len:
        buf = proto.get_buffer(data_len)
        buf_len = len(buf)
        if not buf_len:
            raise RuntimeError('get_buffer() returned an empty buffer')

        if buf_len >= data_len:
            buf[:data_len] = data
            proto.buffer_updated(data_len)
            return
        else:
            buf[:buf_len] = data[:buf_len]
            proto.buffer_updated(buf_len)
            data = data[buf_len:]
            data_len = len(data)


def feed_new(proto, data):
    data_len = len(data)
    start = 0
    while data_len:
        buf = proto.get_buffer(data_len)
        buf_len = len(buf)
        if not buf_len:
            raise RuntimeError('get_buffer() returned an empty buffer')

        if buf_len >= data_len:
            buf[:data_len] = data[start:start + data_len] if start else data
            proto.buffer_updated(data_len)
            return
        else:
            buf[:buf_len] = data[start:start + buf_len]
            proto.buffer_updated(buf_len)
            start += buf_len
            data_len -= buf_len


class Proto(BufferedProtocol):
    def __init__(self, buf_size):
        self._buf = bytearray(buf_size)

    def get_buffer(self, sizehint):
        return self._buf

    def buffer_updated(self, nbytes):
        pass


def bench(loops, feed, data, proto):
    range_it = range(loops)
    t0 = pyperf.perf_counter()
    for _ in range_it:
        feed(proto, data)
    return pyperf.perf_counter() - t0


def add_cmdline_args(cmd, args):
    cmd.extend(("--variant", args.variant))


runner = pyperf.Runner(add_cmdline_args=add_cmdline_args)
runner.argparser.add_argument(
    "--variant", choices=["old", "new"], required=True,
    help="impl to bench",
)
args = runner.parse_args()
feed = feed_old if args.variant == "old" else feed_new

scenarios = [
    (64 * 1024,        4096),
    (256 * 1024,       4096),
    (1024 * 1024,      4096),
    (4 * 1024 * 1024,  4096),
    (1024 * 1024,      65536),

    (100,              4096),    
    (1024,             4096),    
    (4096,             4096),    
    (32 * 1024,        65536),
]

for ds, bs in scenarios:
    proto = Proto(bs)
    data = b"x" * ds
    runner.bench_time_func(f"data={ds:>8} buf={bs:>5}", bench, feed, data, proto)

picnixz · 2026-05-30T17:48:09Z

FTR, the titles of your PRs are always missing a colon after the gh-XXXXXX issue number (I've added it)

deadlovelll · 2026-05-30T17:49:35Z

FTR, the titles of your PRs are always missing a colon after the gh-XXXXXX issue number (I've added it)

Thank you! I'll take this into account for the future

…buffered_proto

picnixz

Can you update your benchmarks?
If there is a way to see the improvement with a visible example (like, where is this function being called in a real-world application, or if there is some public endpoint we can refer to), please add a NEWS entry mentioning this improvement.
Can you add some tests that would exercise this change if none exists?
Please don't force-push in the future (here it's ok because I didn't review it yet, but forcepushing discards commit history and incremental reviews are hard).

deadlovelll · 2026-05-30T20:15:08Z

Can you update your benchmarks?

If there is a way to see the improvement with a visible example (like, where is this function being called in a real-world application, or if there is some public endpoint we can refer to), please add a NEWS entry mentioning this improvement.

Can you add some tests that would exercise this change if none exists?

Please don't force-push in the future (here it's ok because I didn't review it yet, but forcepushing discards commit history and incremental reviews are hard).

Updated benchmarks. Removed memoryview mentions from issue and PR description.
Added NEWS entry mentioning the call site
Added FeedDataToBufferedProtoTests inLib/test/test_asyncio/test_protocols.py with two tests:

test_large_multi_iteration cacthes offset drift
test_memoryview_input verifies that offset-tracking works with memoryview input too
Got it about force-pushing, I will avoid it in future contribs

deadlovelll requested review from 1st1, asvetlov, kumaraditya303 and willingc as code owners May 30, 2026 17:43

bedevere-app Bot mentioned this pull request May 30, 2026

asyncio.protocols._feed_data_to_buffered_proto: avoid O(N^2) bytes slicing #150621

Open

bedevere-app Bot added the awaiting review label May 30, 2026

picnixz changed the title ~~gh-150621 asyncio.protocols._feed_data_to_buffered_proto: avoid O(N^2) bytes slicing~~ gh-150621: avoid quadratic bytes slicing in asyncio.protocols._feed_data_to_buffered_proto May 30, 2026

picnixz changed the title ~~gh-150621: avoid quadratic bytes slicing in asyncio.protocols._feed_data_to_buffered_proto~~ gh-150621: avoid quadratic bytes slicing in asyncio.protocols._feed_data_to_buffered_proto May 30, 2026

pythongh-150621: Avoid O(N^2) bytes slicing in asyncio _feed_data_to_…

cfda592

…buffered_proto

deadlovelll force-pushed the gh-150621-feed_buff_mv branch from 7e04c75 to cfda592 Compare May 30, 2026 18:43

picnixz reviewed May 30, 2026

View reviewed changes

deadlovelll added 2 commits May 30, 2026 22:35

add NEWS entry

9e684bb

Add tests for _feed_data_to_buffered_proto

05989d3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gh-150621: avoid quadratic bytes slicing in `asyncio.protocols._feed_data_to_buffered_proto`#150622

gh-150621: avoid quadratic bytes slicing in `asyncio.protocols._feed_data_to_buffered_proto`#150622
deadlovelll wants to merge 3 commits into
python:mainfrom
deadlovelll:gh-150621-feed_buff_mv

deadlovelll commented May 30, 2026 •

edited

Loading

Uh oh!

picnixz commented May 30, 2026

Uh oh!

deadlovelll commented May 30, 2026

Uh oh!

picnixz left a comment •

edited

Loading

Uh oh!

deadlovelll commented May 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

deadlovelll commented May 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

picnixz commented May 30, 2026

Uh oh!

deadlovelll commented May 30, 2026

Uh oh!

picnixz left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

deadlovelll commented May 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

deadlovelll commented May 30, 2026 •

edited

Loading

picnixz left a comment •

edited

Loading