Improve the performance when using enumeration by aplopez · Pull Request #8395 · SSSD/sssd

aplopez · 2026-01-21T14:36:29Z

This PR includes:

Removal of an unused function.
Stop logging a possibly extremely long filter.
Fixes a wrong condition invalidating an optimization.
Adds a test case for an existing test.

Enumeration, specially when there are 15,000+ users, is slow. This fix helps, but it doesn't work miracles.
In my test environment, the enumeration went from 8 minutes to about 1.

It is important to know that, with such an amount of users, many operations time out. It is necessary to increment the timeout in[nss] and for the domain, but also set large values for ldap_enumeration_refresh_timeout and ldap_search_timeout in the domain. I used these values to avoid any timeout (YMMV):

[domain/ldap.test]
ldap_enumeration_refresh_timeout = 30000
ldap_search_timeout = 6000
timeout = 6000
...

[nss]
timeout = 6000
...

gemini-code-assist

Code Review

This pull request effectively improves performance by optimizing logging, removing an unused function, and correcting a condition related to enumeration. The changes are well-aligned with the stated goals of enhancing enumeration performance, especially for large user bases. The addition of a new test case for the general enumeration scenario ensures that the modified logic is adequately covered.

src/db/sysdb_search.c

alexey-tikhonov · 2026-01-22T12:11:25Z

Mistype in the commit message: "We must look into de TS cache"

aplopez · 2026-01-22T15:48:24Z

Mistype in the commit message: "We must look into de TS cache"

Fixed.

alexey-tikhonov · 2026-01-23T09:57:49Z

I think fix is correct in the sense it fixes a bug.

But I think logic of sysdb_enumpwent_filter() can and should be improved in general to avoid a case when dn_filter expands to entire db.

In particular, if addtl_filter isn't set, then sysdb_search_ts_users(enum_filter(NULL)) is expected to return entire db, right? And using this as additional filter results in the same as '*' but extremely slow.
Or do I miss something?

src/db/sysdb_search.c

alexey-tikhonov · 2026-02-24T18:15:47Z

Note: Covsan is green so far.

alexey-tikhonov · 2026-02-25T09:01:40Z

Hm,
F44:

FAILED tests/test_infopipe.py::test_infopipe__list_by_name (ldap) - AssertionError: ListByName('user-*', 0) is missing element 10002
assert '/org/freedesktop/sssd/infopipe/Users/test/10002' in ['/org/freedesktop/sssd/infopipe/Users/test/10001', '/org/freedesktop/sssd/infopipe/Users/test/10003']

Looks relevant, but why f44 only... race condition?

aplopez · 2026-02-25T18:17:41Z

Looks relevant, but why f44 only... race condition?

I reran the tests and a different test failed. 😮‍💨
Locally, on my PC (Fedora 43, though) the test passes every time.

aplopez · 2026-02-26T14:02:50Z

And now all the tests passed. There is some instability in F44, but not related to this PR.

alexey-tikhonov · 2026-02-26T14:59:13Z

And now all the tests passed. There is some instability in F44, but not related to this PR.

It is very suspicious that it was test_infopipe__list_by_name that I didn't see failing before.
Can there be a race condition in the test itself that is triggered by slow runner?

aplopez · 2026-02-26T16:22:53Z

It is very suspicious that it was test_infopipe__list_by_name that I didn't see failing before. Can there be a race condition in the test itself that is triggered by slow runner?

I thought the same until I noticed this test failed once and never again. The second time a completely different test failed. The third time, the latest, none.

src/tests/cmocka/test_sysdb_views.c

sumit-bose · 2026-03-19T08:03:48Z

Hi,

I think I have no further comments to the code and it looks like the enumeration performance also improved. Did you run some enumeration test with and without the latest version patches as well. If yes, can you share the result here so that we have some value for future reference, if needed? If possible, results with sssd-2.7.3 would be nice as well.

bye,
Sumit

aplopez · 2026-03-19T08:53:55Z

Did you run some enumeration test with and without the latest version patches as well. If yes, can you share the result here so that we have some value for future reference, if needed?

The results are instantaneous when the cache is populated, that is, when the task Enumeration [id] has finished.
While the task is running, the results are variable, but still under 10 seconds.

If possible, results with sssd-2.7.3 would be nice as well.

I don't have such an environment and since I'm running the tests on my own PC, switching to 2.7.3 is not a simple task.

Using a VM with 2.7.3 might be a posibility, but I don't know to which extent those results are comparable. If you think this is a viable way, let me know and I will give it a try.

src/db/sysdb_search.c

src/db/sysdb_init.c

src/db/sysdb_search.c

alexey-tikhonov · 2026-03-19T19:20:50Z

The "name" attribute weas not being added to the YS cache, even thought

weas - > were
YS -> TS
thought -> though

alexey-tikhonov · 2026-03-19T19:21:33Z

handle this case correctly but why waisting time calling them?

waisting -> wasting

Create the filter to retrieve only the requested entries. Do not create a new filter and search for matches if there is no results from the previous search. The called functions handle this case correctly but why wasting time calling them?

Function cache_req_user_by_filter_lookup() will set or not the recent filter depending on whether data->name.attr is set or not. As mentioned in the comment, it should be done base on whether the refernced attribute is name or not.

The message said that sysdb_enumpwent() had failed, but it was actually sysdb_enumpwent_filter() which failed.

aplopez · 2026-03-20T09:16:29Z

Fixed all the typos.

alexey-tikhonov · 2026-03-20T10:17:44Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces several changes to improve user enumeration performance, particularly in environments with a large number of users. The key changes include removing an unused function, correcting a condition to enable an optimization for name-based filtering, preventing the logging of potentially very long filter strings, and updating tests accordingly. The changes appear sound and should deliver the described performance benefits. I have one suggestion to improve the accuracy of a debug log message.

src/db/sysdb_search.c

alexey-tikhonov · 2026-03-20T11:20:15Z

Note: Covscan is green (barring mistype in the latest update).

The "name" attribute was not being added to the TS cache, even though that it is part of the DN (ldb doesn't enforce it). This made the if-block in sysdb_enumpwent_filter() rather useless. In addition, once this if-block is executed, the fuction leaves without further processing.

Although ts_res.count is set to 0 when sysdb_search_ts_users() return ERR_NO_TS, before using it we make an extra check to verify that the returned code is EOK.

alexey-tikhonov · 2026-03-20T11:38:57Z

Thank you, ACK.

gemini-code-assist bot reviewed Jan 21, 2026

View reviewed changes

src/db/sysdb_search.c Show resolved Hide resolved

alexey-tikhonov self-assigned this Jan 21, 2026

alexey-tikhonov self-requested a review January 21, 2026 14:44

aplopez force-pushed the enumerate branch from 955232a to f98eca5 Compare January 21, 2026 18:05

alexey-tikhonov reviewed Jan 22, 2026

View reviewed changes

src/db/sysdb_search.c Show resolved Hide resolved

aplopez added the backport-to-sssd-2-9 label Jan 22, 2026

alexey-tikhonov reviewed Jan 22, 2026

View reviewed changes

src/db/sysdb_search.c Outdated Show resolved Hide resolved

alexey-tikhonov added the Bugzilla label Jan 22, 2026

alexey-tikhonov requested a review from sumit-bose January 22, 2026 13:45

alexey-tikhonov assigned sumit-bose Jan 22, 2026

aplopez force-pushed the enumerate branch from f98eca5 to c78e8f6 Compare January 22, 2026 15:47

aplopez force-pushed the enumerate branch from c78e8f6 to 631e4be Compare January 23, 2026 10:05

alexey-tikhonov reviewed Feb 12, 2026

View reviewed changes

src/db/sysdb_search.c Show resolved Hide resolved

aplopez marked this pull request as ready for review February 24, 2026 13:19

aplopez added the Waiting for review label Feb 24, 2026

aplopez force-pushed the enumerate branch from b3819b1 to ed557b2 Compare February 24, 2026 16:54

alexey-tikhonov added the coverity Trigger a coverity scan label Feb 24, 2026

alexey-tikhonov removed the coverity Trigger a coverity scan label Feb 24, 2026

sumit-bose reviewed Mar 6, 2026

View reviewed changes

src/tests/cmocka/test_sysdb_views.c Outdated Show resolved Hide resolved

alexey-tikhonov linked an issue Mar 7, 2026 that may be closed by this pull request

NSS enumerated passwd/group truncated output and performance regression since >=2.8.0 #6951

Open

alexey-tikhonov mentioned this pull request Mar 7, 2026

NSS enumerated passwd/group truncated output and performance regression since >=2.8.0 #6951

Open

aplopez force-pushed the enumerate branch from 2e12f46 to 2c0c9b2 Compare March 18, 2026 08:16

alexey-tikhonov reviewed Mar 19, 2026

View reviewed changes

src/db/sysdb_search.c Outdated Show resolved Hide resolved

alexey-tikhonov reviewed Mar 19, 2026

View reviewed changes

src/db/sysdb_init.c Show resolved Hide resolved

alexey-tikhonov reviewed Mar 19, 2026

View reviewed changes

src/db/sysdb_search.c Outdated Show resolved Hide resolved

alexey-tikhonov reviewed Mar 19, 2026

View reviewed changes

src/db/sysdb_search.c Show resolved Hide resolved

alexey-tikhonov added Changes requested and removed Waiting for review labels Mar 19, 2026

aplopez added 3 commits March 20, 2026 10:15

NSS: Some optimizations.

650f33c

Create the filter to retrieve only the requested entries. Do not create a new filter and search for matches if there is no results from the previous search. The called functions handle this case correctly but why wasting time calling them?

NSS: Be coherent when using a lastUpdate filter

9de7cdc

Function cache_req_user_by_filter_lookup() will set or not the recent filter depending on whether data->name.attr is set or not. As mentioned in the comment, it should be done base on whether the refernced attribute is name or not.

NSS: Fix the logged function name

d92a1c1

The message said that sysdb_enumpwent() had failed, but it was actually sysdb_enumpwent_filter() which failed.

aplopez force-pushed the enumerate branch from 2c0c9b2 to 68cfc33 Compare March 20, 2026 09:43

aplopez added Waiting for review and removed Changes requested labels Mar 20, 2026

alexey-tikhonov added the coverity Trigger a coverity scan label Mar 20, 2026

gemini-code-assist bot reviewed Mar 20, 2026

View reviewed changes

src/db/sysdb_search.c Outdated Show resolved Hide resolved

aplopez force-pushed the enumerate branch from 68cfc33 to 9927463 Compare March 20, 2026 10:37

alexey-tikhonov removed the coverity Trigger a coverity scan label Mar 20, 2026

aplopez added 2 commits March 20, 2026 12:29

NSS: Better handle ERR_NO_TS in sysdb_enumpwent_filter()

82bac13

Although ts_res.count is set to 0 when sysdb_search_ts_users() return ERR_NO_TS, before using it we make an extra check to verify that the returned code is EOK.

aplopez force-pushed the enumerate branch from 9927463 to 82bac13 Compare March 20, 2026 11:29

alexey-tikhonov approved these changes Mar 20, 2026

View reviewed changes

alexey-tikhonov requested a review from sumit-bose March 20, 2026 11:38

Conversation

aplopez commented Jan 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alexey-tikhonov commented Jan 22, 2026

Uh oh!

aplopez commented Jan 22, 2026

Uh oh!

alexey-tikhonov commented Jan 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

alexey-tikhonov commented Feb 24, 2026

Uh oh!

alexey-tikhonov commented Feb 25, 2026

Uh oh!

aplopez commented Feb 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aplopez commented Feb 26, 2026

Uh oh!

alexey-tikhonov commented Feb 26, 2026

Uh oh!

aplopez commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

sumit-bose commented Mar 19, 2026

Uh oh!

aplopez commented Mar 19, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

alexey-tikhonov commented Mar 19, 2026

Uh oh!

alexey-tikhonov commented Mar 19, 2026

Uh oh!

aplopez commented Mar 20, 2026

Uh oh!

alexey-tikhonov commented Mar 20, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

alexey-tikhonov commented Mar 20, 2026

Uh oh!

alexey-tikhonov commented Mar 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

aplopez commented Jan 21, 2026 •

edited

Loading

alexey-tikhonov commented Jan 23, 2026 •

edited

Loading

aplopez commented Feb 25, 2026 •

edited

Loading

aplopez commented Feb 26, 2026 •

edited

Loading