RANGER-3899: Policy creation/updation takes more time when there are more users,groups,roles by ramackri · Pull Request #962 · apache/ranger

ramackri · 2026-05-21T18:44:56Z

What changes were proposed in this pull request?

Optimized PolicyRefUpdater and related DAOs so policy create/update performs fewer database round-trips when a policy references many users, groups, and roles.

Problem

When creating or updating a policy with a large number of principals (users, groups, roles), Ranger Admin was slow because:

N+1 lookups — Each user, group, role, resource, access type, condition, and data mask type was resolved with a separate DB query (findByUserName, findByGroupName, findByRoleName, etc.).
Full ref-table rebuild on update — On policy update, cleanupRefTables() deleted all rows from x_policy_ref_user, x_policy_ref_group, and x_policy_ref_role (and other ref tables), then re-inserted every principal even when nothing changed.

Changes

Batch lookups via named queries — Added IN (:names) JPA named queries and DAO helpers to resolve principals and definitions in bulk:
- XXUser.getIdsByUserNames
- XXGroup.getIdsByGroupNames
- XXRole.getIdsByRoleNames
- XXResourceDef.findByNamesAndPolicyId
- XXAccessTypeDef.findByNamesAndServiceId
- XXPolicyConditionDef.findByServiceDefIdAndNames
- XXDataMaskTypeDef.findByNamesAndServiceId
Pre-resolved principal IDs — PolicyPrincipalAssociator now accepts a pre-fetched principal ID from the bulk map, avoiding per-name lookups when the principal already exists.
Selective cleanup on policy update — Introduced cleanupRefTablesForUpdate() that:
- Loads existing user/role/group ref rows for the policy
- Deletes only principals removed from the policy
- Skips re-insert for principals that are unchanged
- Still performs full delete/rebuild for resources, access types, conditions, and data mask types (typically far fewer entries)
Unified create/update flow — createNewPolMappingForRefTable() takes a new isCleanupRefTablesNeeded flag:
- Create: false (no prior ref rows)
- Update: true (runs selective cleanup before insert)
- ServiceDBStore no longer calls cleanupRefTables() separately before mapping
Batch insert helper — Added batchInsert() with bulk-mode handling and debug timing logs around dao.batchCreate().

Files changed (15)

Area	Files
Core logic	`PolicyRefUpdater.java`, `ServiceDBStore.java`
DAOs	`XXUserDao`, `XXGroupDao`, `XXRoleDao`, `XXResourceDefDao`, `XXAccessTypeDefDao`, `XXPolicyConditionDefDao`, `XXDataMaskTypeDefDao`, `XXPolicyRefUserDao`, `XXPolicyRefGroupDao`, `XXPolicyRefRoleDao`
Queries	`jpa_named_queries.xml`
Tests	`TestPolicyRefUpdater.java`, `TestServiceDBStore.java`

How was this patch tested?

Unit tests

Updated TestPolicyRefUpdater for the new bulk-lookup APIs and isCleanupRefTablesNeeded parameter
Added testCleanupRefTablesForUpdate_SelectivePrincipalCleanup to verify selective delete/insert for users, roles, and groups on update
Updated TestServiceDBStore for the new createNewPolMappingForRefTable signature

mvn -pl security-admin test -Dtest=TestPolicyRefUpdater,TestServiceDBStore

…more users,groups,roles

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

mneethiraj · 2026-05-21T20:21:48Z

        }
    }
+
+    public Map<String, XXResourceDef> findXXResourceDefsByNameAndPolicyId(Set<String> names, Long policyId) {


The caller of findXXResourceDefsByNameAndPolicyId() only needs resourceId (XXResourceDef.id) for each resource name. Retrieving only id from the database will help improve the performance.

I suggest updating & renaming the method from:

public Map<String, XXResourceDef> findXXResourceDefsByNameAndPolicyId(Set<String> names, Long policyId)

to:

public Map<String, Long> findResourceDefIdsByNameAndPolicyId(Set<String> names, Long policyId)

Update the named query XXResourceDef.findByNamesAndPolicyId to return id instead of XXResourceDef.

Review similar updates to following named queries and corresponding Dao methods that use them:

XXAccessTypeDef.findByNamesAndServiceId

XXPolicyConditionDef.findByServiceDefIdAndNames

XXDataMaskTypeDef.findByNamesAndServiceId

mneethiraj · 2026-05-21T20:43:32Z

+    public Map<String, XXResourceDef> findXXResourceDefsByNameAndPolicyId(Set<String> names, Long policyId) {
+        Map<String, XXResourceDef> ret = Collections.emptyMap();
+        if (policyId == null || CollectionUtils.isEmpty(names)) {
+            return ret;


Consider having a single return statement in a method.

mneethiraj · 2026-05-21T20:51:42Z

+                    .setParameter("policyId", policyId)
+                    .getResultList();
+
+            Map<String, Long> userNameIdMap = new HashMap<>();


To be consistent, consider using streams to convert results ito userNameIdMap - similar to XXRoleDao.getIdsByRoleNames().

mneethiraj · 2026-05-21T20:52:26Z

+                    .setParameter("policyId", policyId)
+                    .getResultList();
+
+            Map<String, Long> roleNameIdMap = new HashMap<>();


To be consistent, consider using streams to convert results ito roleNameIdMap - similar to XXRoleDao.getIdsByRoleNames().

mneethiraj · 2026-05-21T20:52:55Z

+                    .setParameter("policyId", policyId)
+                    .getResultList();
+
+            Map<String, Long> groupNameIdMap = new HashMap<>();


To be consistent, consider using streams to convert results ito groupNameIdMap - similar to XXRoleDao.getIdsByRoleNames().

mneethiraj · 2026-05-21T23:06:38Z

+                    .collect(Collectors.toSet());
+            Map<String, Long> roleNameIdMap = daoMgr.getXXRole().getIdsByRoleNames(filteredRoleNames);
+            for (String roleName : filteredRoleNames) {
+                PolicyPrincipalAssociator associator = new PolicyPrincipalAssociator(PRINCIPAL_TYPE.ROLE, roleName, xPolicy, roleNameIdMap.get(roleName), null, null, xPolRoles);


Instead of having PolicyPrincipalAssociator take 3 collection parameters and populating one of them in doAssociate() call, consider:

creating 3 classes PolicyRoleAssociator, PolicyGroupAssociator and PolicyUserAssociator

add method getPolicyRef() to return XXPolicyRefRole/XXPolicyRefGroup/XXPolicyRefUser when the policy exists

add method createPolicyRef() to create XXPolicyRefRole/XXPolicyRefGroup/XXPolicyRefUser when the policy exists

Long roleId = roleNameIdMap.get(roleName); PolicyRoleAssociator associator = new PolicyRoleAssociator(roleName, roleId, xPolicy); if (roleId != null) { XXPolicyRoleRef roleRef = associator.getPolicyRef(); if (roleRef != null) { xPolRoles.add(roleRef); } } else if (createPrincipalsIfAbsent) { rangerTransactionSynchronizationAdapter.executeOnTransactionCommit(associator); } else { VXResponse gjResponse = new VXResponse(); ... throw restErrorUtil.generateRESTException(gjResponse); }

Review handling of groups and users references as well - line 243 and 268.

mneethiraj · 2026-05-21T23:17:13Z

+        XXPolicyRefRoleDao xxPolicyRefRoleDao = daoMgr.getXXPolicyRefRole();
+        XXPolicyRefGroupDao xxPolicyRefGroupDao = daoMgr.getXXPolicyRefGroup();
+
+        Map<String, Long> policyUserNameIdMap   = xPolicyRefUserDao.findUserNameIdByPolicyId(policyId);


Consider moving lines 386, 387 and 388 into cleanupPolicyRefUsers(), cleanupPolicyRefRoles() and cleanupPolicyRefGroups() respectively - to improve readability.

RANGER-3899: Policy creation/updation takes more time when there are …

e369e4e

…more users,groups,roles

ramackri self-assigned this May 21, 2026

ramackri marked this pull request as draft May 21, 2026 18:45

ramackri requested review from kumaab, mneethiraj and pradeepagrawal8184 May 21, 2026 18:47

rameeshm requested a review from Copilot May 21, 2026 19:38

Copilot started reviewing on behalf of rameeshm May 21, 2026 19:38 View session

Copilot AI reviewed May 21, 2026

View reviewed changes

mneethiraj reviewed May 22, 2026

View reviewed changes

Fixing Review comments

c1222c6

Conversation

ramackri commented May 21, 2026

What changes were proposed in this pull request?

Problem

Changes

Files changed (15)

How was this patch tested?

Unit tests

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants