Skip to content

feat: extend indexer with prefix matching and db persistence#1422

Open
thunguo wants to merge 1 commit intoapache:developfrom
thunguo:feature/extend-indexer-prefix-match
Open

feat: extend indexer with prefix matching and db persistence#1422
thunguo wants to merge 1 commit intoapache:developfrom
thunguo:feature/extend-indexer-prefix-match

Conversation

@thunguo
Copy link

@thunguo thunguo commented Mar 7, 2026

Please provide a description of this PR:
Extend indexer with prefix matching and db persistence.
Issue #1424

To help us figure out who should review this PR, please put an X in all the areas that this PR affects.

  • Docs
  • Installation
  • User Experience
  • Dubboctl
  • Console
  • Core Component

Please check any characteristics that apply to this pull request.

@sonarqubecloud
Copy link

sonarqubecloud bot commented Mar 7, 2026

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Extends the core resource indexing/query mechanism to support operator-based index conditions (including prefix matching) and introduces DB persistence for indexes to support consistent querying in multi-replica deployments (Issue #1424).

Changes:

  • Refactors ListByIndexes / PageListByIndexes APIs from map[string]string to []index.IndexCondition (adds Equals and HasPrefix operators).
  • Adds in-memory prefix matching via a Radix tree structure and introduces DB-backed persisted index storage via a new resource_indices table.
  • Updates manager/helpers, discovery/engine subscribers, and console services to use the new index condition API; adds tests for prefix matching and DB persistence.

Reviewed changes

Copilot reviewed 21 out of 22 changed files in this pull request and generated 12 comments.

Show a summary per file
File Description
pkg/store/memory/store.go Adds Radix-tree-backed prefix matching and updates index query APIs to use IndexCondition.
pkg/store/memory/store_test.go Updates existing tests for new index API and adds HasPrefix test coverage.
pkg/store/dbcommon/model.go Introduces ResourceIndexModel mapping persisted index entries (resource_indices).
pkg/store/dbcommon/index.go Adds Index.AddEntry to support rebuilding in-memory indices from persisted entries.
pkg/store/dbcommon/gorm_store.go Migrates resource_indices, persists index entries on CRUD/replace, adds DB prefix query path, rebuilds indices from persisted table.
pkg/store/dbcommon/gorm_store_test.go Updates tests for new index API and adds tests for persistence, prefix matching, and rebuild.
pkg/core/store/store.go Updates ResourceStore interface to accept []IndexCondition.
pkg/core/store/index/condition.go Adds IndexCondition and IndexOperator (Equals, HasPrefix).
pkg/core/manager/manager_helper.go Updates helper methods to accept []IndexCondition; removes unused fuzzy-search helper.
pkg/core/manager/manager.go Updates ReadOnlyResourceManager/implementation signatures for condition-based index queries; removes unimplemented fuzzy search method.
pkg/core/engine/subscriber/runtime_instance.go Updates instance lookups to use IndexCondition with Equals.
pkg/core/discovery/subscriber/rpc_instance.go Updates runtime instance lookup to use condition-based indexing.
pkg/core/discovery/subscriber/nacos_service.go Updates multi-index lookups to use []IndexCondition with Equals.
pkg/core/discovery/subscriber/instance.go Updates instance listing by mesh/app to use []IndexCondition.
pkg/console/service/tag_rule.go Updates paging by mesh index to use []IndexCondition.
pkg/console/service/service.go Updates service/provider/consumer queries to use condition-based indexing.
pkg/console/service/instance.go Updates instance search queries to use []IndexCondition.
pkg/console/service/configurator_rule.go Updates paging by mesh index to use []IndexCondition.
pkg/console/service/condition_rule.go Updates paging by mesh index to use []IndexCondition.
pkg/console/service/application.go Updates multiple application-related list/page queries to use condition-based indexing.
go.mod Updates dependencies for Radix tree and other module tidy changes.
go.sum Updates module checksums to match go.mod dependency changes.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

You can also share your feedback on Copilot code review. Take the survey.

func (rs *resourceStore) Delete(obj interface{}) error {
r, ok := obj.(coremodel.Resource)
if ok {
rs.removeFromTrees(r)
Copy link

Copilot AI Mar 8, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Delete removes entries from the prefix trees using the object passed in. If callers delete using a minimal object (or a tombstone) that doesn't contain the original indexed fields, removeFromTrees won't compute the same index values and stale RadixTree entries will remain. To make deletes reliable, fetch the existing stored resource by key (similar to Update) and remove based on that stored object before deleting.

Suggested change
rs.removeFromTrees(r)
// Fetch the existing stored resource by key to reliably remove from prefix trees.
if existingObj, exists, err := rs.storeProxy.GetByKey(r.ResourceKey()); err == nil && exists {
if existingRes, ok := existingObj.(coremodel.Resource); ok {
rs.removeFromTrees(existingRes)
}
}

Copilot uses AI. Check for mistakes.
if _, exists := rs.prefixTrees[indexName]; !exists {
rs.prefixTrees[indexName] = radix.New()
}
}
Copy link

Copilot AI Mar 8, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

AddIndexers creates new RadixTrees for newly added indexers, but it never backfills existing stored resources into those new trees. After adding an indexer to a non-empty store, HasPrefix queries on that index will return no results until resources are re-added/updated. Consider iterating over existing items (e.g., rs.storeProxy.List()) and populating the new tree(s) immediately.

Suggested change
}
}
// Backfill newly added radix trees with existing items in the store so that
// prefix-based queries on the new indexes work immediately.
existingItems := rs.storeProxy.List()
for indexName, indexFunc := range newIndexers {
tree, exists := rs.prefixTrees[indexName]
if !exists {
continue
}
for _, obj := range existingItems {
indexedValues, err := indexFunc(obj)
if err != nil {
return err
}
for _, v := range indexedValues {
if v == "" {
continue
}
// We only need the key in the radix tree for prefix lookups, so the value
// can be a trivial placeholder.
tree.Insert(v, struct{}{})
}
}
}

Copilot uses AI. Check for mistakes.

// Delete old entries if updating
if oldResource != nil {
if err := db.Where("resource_key = ?", oldResource.ResourceKey()).Delete(&ResourceIndexModel{}).Error; err != nil {
Copy link

Copilot AI Mar 8, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

When updating, old index entries are deleted with WHERE resource_key = ? only. Since ResourceKey() is just mesh/name (not globally unique across resource kinds), this can delete index rows belonging to other kinds that share the same key. Include resource_kind = ? (and ideally also index_name if you want narrower deletes) in the delete filter.

Suggested change
if err := db.Where("resource_key = ?", oldResource.ResourceKey()).Delete(&ResourceIndexModel{}).Error; err != nil {
if err := db.Where("resource_key = ? AND resource_kind = ?", oldResource.ResourceKey(), gs.kind.ToString()).Delete(&ResourceIndexModel{}).Error; err != nil {

Copilot uses AI. Check for mistakes.
Comment on lines +657 to +661
// deleteIndexEntries removes all index entries for a resource key
func (gs *GormStore) deleteIndexEntries(resourceKey string) error {
db := gs.pool.GetDB()
return db.Where("resource_key = ?", resourceKey).Delete(&ResourceIndexModel{}).Error
}
Copy link

Copilot AI Mar 8, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

deleteIndexEntries deletes by resource_key only. Because resource keys are not unique across kinds, this can remove index entries for other resource kinds. Filter by both resource_kind and resource_key to avoid cross-kind data loss.

Copilot uses AI. Check for mistakes.
Comment on lines +667 to +669
err := db.Where("resource_kind = ? AND index_name = ? AND index_value LIKE ?",
gs.kind.ToString(), indexName, prefix+"%").
Find(&entries).Error
Copy link

Copilot AI Mar 8, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Prefix matching uses index_value LIKE prefix + '%'. If prefix can contain SQL wildcard characters (% / _), the query will match more than a literal prefix. If the intent is literal prefix semantics, escape wildcards in prefix (and add an ESCAPE clause) before building the LIKE pattern.

Copilot uses AI. Check for mistakes.
Comment on lines +351 to +357
tree.WalkPrefix(prefix, func(k string, v interface{}) bool {
// Key format: "indexValue/resourceKey"
// Extract the resourceKey part (after the last "/")
idx := strings.LastIndex(k, "/")
if idx >= 0 && idx < len(k)-1 {
keys = append(keys, k[idx+1:])
}
Copy link

Copilot AI Mar 8, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

getKeysByPrefix extracts the resource key using strings.LastIndex(k, "/"), which will truncate keys like mesh/name to just name. Since GetByKey expects the full resource key, HasPrefix queries can return keys that will never resolve to stored resources. The tree should store the full resource key without needing to parse it back out of the string key.

Copilot uses AI. Check for mistakes.

if len(indexEntries) > 0 {
if err := tx.CreateInBatches(&indexEntries, 100).Error; err != nil {
logger.Warnf("failed to persist index entries during replace: %v", err)
Copy link

Copilot AI Mar 8, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

During Replace, failure to persist resource_indices is only logged and the transaction still commits. That can leave follower replicas without usable persisted indexes until a full rebuild occurs. If persisted indexes are required for correctness in multi-replica mode, consider returning the error so the transaction rolls back (or at least make the behavior configurable).

Suggested change
logger.Warnf("failed to persist index entries during replace: %v", err)
logger.Warnf("failed to persist index entries during replace: %v", err)
return err

Copilot uses AI. Check for mistakes.
Comment on lines +692 to +709
// Try to load from resource_indices table (persisted indices)
var entries []ResourceIndexModel
db := gs.pool.GetDB()
if err := db.Where("resource_kind = ?", gs.kind.ToString()).Find(&entries).Error; err != nil {
logger.Warnf("failed to load from resource_indices for %s: %v", gs.kind.ToString(), err)
return gs.rebuildIndicesFromResources()
}

// If resource_indices table has entries, rebuild from there
if len(entries) > 0 {
for _, entry := range entries {
gs.indices.AddEntry(entry.IndexName, entry.IndexValue, entry.ResourceKey)
}
logger.Infof("Rebuilt indices for %s from resource_indices table: %d entries", gs.kind.ToString(), len(entries))
return nil
}

// Fallback: rebuild from resources table
Copy link

Copilot AI Mar 8, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

rebuildIndices rebuilds from resource_indices as soon as it finds any rows for the kind. If indexers have changed (new index added) or the table is only partially populated, this will skip rebuilding missing indexes from the resource table, leaving queries incomplete. Consider validating that required index names are present (or versioning the index schema) and falling back to rebuildIndicesFromResources when coverage is incomplete.

Suggested change
// Try to load from resource_indices table (persisted indices)
var entries []ResourceIndexModel
db := gs.pool.GetDB()
if err := db.Where("resource_kind = ?", gs.kind.ToString()).Find(&entries).Error; err != nil {
logger.Warnf("failed to load from resource_indices for %s: %v", gs.kind.ToString(), err)
return gs.rebuildIndicesFromResources()
}
// If resource_indices table has entries, rebuild from there
if len(entries) > 0 {
for _, entry := range entries {
gs.indices.AddEntry(entry.IndexName, entry.IndexValue, entry.ResourceKey)
}
logger.Infof("Rebuilt indices for %s from resource_indices table: %d entries", gs.kind.ToString(), len(entries))
return nil
}
// Fallback: rebuild from resources table
// Always rebuild from the resources table to ensure indices are complete
// and compatible with the current indexers. This will also repopulate
// the resource_indices table via persistIndexEntries.

Copilot uses AI. Check for mistakes.
Comment on lines +794 to +804
// Create mock resources with IP-like values for prefix matching
mockRes1 := &mockResource{
kind: "TestInstance",
key: "instance-1",
mesh: "default",
meta: metav1.ObjectMeta{
Name: "instance-1",
Labels: map[string]string{
"ip": "192.168.1.10",
},
},
Copy link

Copilot AI Mar 8, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These HasPrefix tests use resource keys like "instance-1" that don't contain the path separator. In production ResourceKey() is typically mesh/name, so keys include /; using a realistic key format here would catch issues where prefix indexing/parsing mishandles / in resource keys.

Copilot uses AI. Check for mistakes.
Comment on lines 98 to +101
require (
cel.dev/expr v0.23.0 // indirect
filippo.io/edwards25519 v1.1.0 // indirect
github.com/BurntSushi/toml v1.3.2 // indirect
github.com/armon/go-radix v1.0.0 // indirect
Copy link

Copilot AI Mar 8, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

github.com/armon/go-radix is imported directly from non-test code (pkg/store/memory/store.go), but in go.mod it is listed with // indirect. Running go mod tidy (or moving it to the direct require block) should make the dependency classification accurate.

Copilot uses AI. Check for mistakes.
Copy link
Contributor

@robocanic robocanic left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great Work!I left some comments and hope you can discuss them with me.

@@ -201,6 +212,11 @@ func (gs *GormStore) Update(obj interface{}) error {
// Update indices: remove old and add new
gs.indices.UpdateResource(resource, oldResource.(model.Resource))
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggestion: 既然用的是DB类型的store,内存的那个index的indices就不用存储了,存储都放到DB里面。也就是DB里面的“resource_indices”这个表即可以存前缀匹配的索引,也可以存全值匹配的索引。

// This table stores index mappings to enable prefix queries and provide index persistence
// across multiple replicas in distributed deployments
// Table: resource_indices (shared across all resource kinds)
type ResourceIndexModel struct {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggestion:加一列Operator,拓展性会更好

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants