Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DocDB] Tserver error: Duplicate memory Tracker(id LogCache-228736c6d3e4459f921ab53bb938bca1) on parent LogCache->server->root #22401

Open
1 task done
shishir2001-yb opened this issue May 15, 2024 · 1 comment
Assignees
Labels
2024.1 Backport Required area/docdb YugabyteDB core features kind/bug This issue is a bug priority/high High Priority

Comments

@shishir2001-yb
Copy link

shishir2001-yb commented May 15, 2024

Jira Link: DB-11302

Description

Version: 2024.1.0.0-b123
Logs: https://drive.google.com/file/d/1GPXCLUnbwOLDvPhFtI-BrIBUUjFDhsgS/view?usp=sharing(4.4 GB) Check Jira to directly view logs

Encountered the following Tserver while running cross DB DDLs test with PITR and Backup/Restore.

(Universe logs -> 172.151.18.212 -> yb-tserver.ip-172-151-18-212.us-west-2.compute.internal.yugabyte.log.ERROR.20240502-214518.1267781)

E0502 21:45:18.905700 1268060 mem_tracker.cc:309] Duplicate memory tracker (id LogCache-228736c6d3e4459f921ab53bb938bca1) on parent LogCache->server->root

Test details:

Test Description:
        1. Create a cluster with required g-flags
        2. Start the cross DB DDL workload which will execute DDLs and DMLs across databases concurrently (50 colocated
           database and 100 non-colocated database), run this for 20-30 mins
        3. Create a PITR schedule on 10 random database
        4. Start a while loop and run it for 120 mins
          a. Note down time fr PITR(0) 
          b. Create a backup of 1 random database
          c. Start the cross DB DDL workload and stop it after 10 mins
          d. Note down the time for PITR(1)
          e. Start the cross DB DDL workload and run it for 10 mins
          f. Execute PITR on all 10 databases at random times(Between 1-9 sec ago).
          g. Restore to PITR(1)
          h. Validate data
          i. Restore to PITR(0) with a probability of 0.6 and validate data
          j. Delete the PITR schedule for the backup db 
          k. Drop the database 
          l. Restore the backup
          m. Create the snapshot schedule for this new DB

G-flags:

 tserver_gflags={
                "ysql_enable_packed_row": "true",
                "ysql_enable_packed_row_for_colocated_table": "true",
                "enable_automatic_tablet_splitting": "true",
                "ysql_max_connections": "500",
                'client_read_write_timeout_ms': str(30 * 60 * 1000),
                'yb_client_admin_operation_timeout_sec': str(30 * 60),
                "consistent_restore": "true",
                "ysql_enable_db_catalog_version_mode": "true",
                "tablet_replicas_per_gib_limit": 0,
                "ysql_pg_conf_csv": "yb_debug_report_error_stacktrace=true",
                "log_ysql_catalog_versions": "true"
            },
            master_gflags={
                "ysql_enable_packed_row": "true",
                "ysql_enable_packed_row_for_colocated_table": "true",
                "enable_automatic_tablet_splitting": "true",
                "consistent_restore": "true",
                "ysql_enable_db_catalog_version_mode": "true",
                "tablet_replicas_per_gib_limit": 0,
                "ysql_pg_conf_csv": "yb_debug_report_error_stacktrace=true",
                "log_ysql_catalog_versions": "true"
            }

Issue Type

kind/bug

Warning: Please confirm that this issue does not contain any sensitive information

  • I confirm this issue does not contain any sensitive information.
@shishir2001-yb shishir2001-yb added area/docdb YugabyteDB core features status/awaiting-triage Issue awaiting triage labels May 15, 2024
@yugabyte-ci yugabyte-ci added kind/bug This issue is a bug priority/medium Medium priority issue labels May 15, 2024
@yugabyte-ci yugabyte-ci added priority/high High Priority and removed priority/medium Medium priority issue labels May 15, 2024
@rthallamko3
Copy link
Contributor

Per @yusong-yan :
Customer impact: No impact to workload. Errors might be spewed to the log.

Engineering impact: We might lose memory tracking for certain LogCache structure of the tablets (Mostly the ones that got shutdown and bootstrapped again very quickly). Also, per tablet MemTracker might not account for the memory, and this can appear to look like a memory leak.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
2024.1 Backport Required area/docdb YugabyteDB core features kind/bug This issue is a bug priority/high High Priority
Projects
None yet
Development

No branches or pull requests

4 participants