garbage_collection
Implements the database gc subcommand for the Merlin CLI.
This module defines the DatabaseGarbageCollectionCommand class, which enables
automated cleanup of stale and orphaned database entries. The garbage collection
process identifies and removes runs with missing filesystem workspaces, workers
that no longer have valid runs, and studies without associated runs.
The command supports selective cleanup through entity-specific flags and includes a dry-run mode for previewing actions before execution. This ensures data integrity by maintaining consistency between database records and filesystem state.
DatabaseGarbageCollectionCommand
Bases: CommandEntryPoint
Handles the database gc subcommand, which runs garbage collection
across the database to remove runs without workspaces, orphaned workers,
and studies with no associated runs.
Methods:
| Name | Description |
|---|---|
add_parser |
Adds the |
process_command |
Processes the CLI input and runs garbage collection. |
Source code in merlin/cli/commands/database/garbage_collection.py
30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 | |
add_parser(subparsers)
Add the gc command parser to the CLI argument parser.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
subparsers
|
ArgumentParser
|
The subparsers object to which the |
required |
Source code in merlin/cli/commands/database/garbage_collection.py
process_command(args)
CLI command for running database garbage collection.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
args
|
Namespace
|
Parsed CLI arguments containing:
|
required |