Triage Summary – 2025-09-19
🧾 Triage Summary for issues
💤 Inactive issues (60+ days without updates)
These issues are inactive and have been marked with the stale
label:
- @kskovola Use json as default log format
- @kskovola Ensure CI stress test job logs properly track the test and indicate success/failure appropriately
- (unassigned) Fix usage of cta-admin options of type OptionStrList::FILE_ID
- (unassigned) Reduce restrictions imposed on repacked tape files being written to the CTA catalogue
- @afonso (confidential)
- (unassigned) Replace the json-c library with jsoncpp
- @nbugel Streamline image building using buildah
- @nbugel Investigate ERROR log messages occuring in CI
-
@afonso Improve
cta-admin sq
command with information about repack and verify requests -
(unassigned) Put
getUsername
andgetHostname
functions into common code which can be reused - (unassigned) Review and clean up Exceptions in CTA
-
@jleduc Move
cta-fst-gcd
from cta to ctaops repository - @afonso Properly report error in WFE when trying to archive a File ID that already exists in the CTA Catalogue
- @afonso Analyse missing repack sub-requests
- @kskovola missing support for CMD_VERSION in grpc-forntend
- @nbugel Improve reporting of errors in bash scripts
- @poliverc Make the use of LBP a drive configuration option
- (unassigned) Add CRC32c comparison values to the logs when the check fails
- @afonso Fix eos report log fields
-
@kskovola Replace
SslServerCredentials
withTlsServerCredentials
in gRPC frontend - @nbugel Create simple system test framework in Python
-
@kskovola CI failure in the
eosdf_systemtest
part oftest_client.sh
- @nbugel Make all logging functions noexcept
- @poliverc Fix SIGCHILD handling loop
-
@afonso Make
repack --justaddcopies
possible in ACTIVE tapes - @poliverc Fix taped uncaught exception when tape drive does not exist in catalogue
- @jchodak Failed to queue successfully transferred job batch to reporting stage
-
@kskovola Support passing
gtest_filter
flag inbuild_deploy
script - @nbugel Fix repack systemtest failing on timeout in move and add copies step
- (unassigned) Allow choosing between Oracle and Postgres RPMs
- @nbugel Ensure creation timestamps and modification timestamps are the same when a file is staged
- @nbugel Add system test for HTTP reading of tape-only replicas on EOS
- (unassigned) Allow the cta-admin tool to modify the physical library model and type
- @afonso cta-frontend fix log consistency for workflow logs
- (unassigned) Add CTA logger to drive::DriveGeneric logic
- @kskovola Update CI infrastructure and instructions for using custom EOS builds
- @poliverc cta-taped drive process coredumping
- @kskovola Add command-line option to override default config file for gRPC frontend
- @kskovola Use consistent terminology in gRPC frontend and workflow processing logic in frontend/common, for workflow events
- (unassigned) Cleanup unused files from the CTA project
- @nbugel Tape REST API - Implement "default" targeted metadata handling
- @nbugel Generate artifacts for manpage for eoscta-docs
- @mdavis Remove unused code
- (unassigned) CTA forgets files on disk and never moves them to tape
- @kskovola Reduce code duplication in tests such as GenericSchedulerTest
- @kskovola Implement a tool to list files on a tape (including the paths on EOS instance)
- @poliverc CTA-Taped should exit on failure to read sss keytab
- (unassigned) Make cta-admin JSON only
- @kskovola Statically link CTA-grpc version for cta-send-event
- @nbugel Change CMAKE_BUILD_TYPE to RelWithDebInfo
- @poliverc Improve 'disabled tape' reason message written by drive cleaner session
- (unassigned) Fix 'priority' column in 'cta-admin dr ls' command
- (unassigned) Add way to test liquibase migration scripts for Postgres in CI
-
@afonso Review
--partialtapesnumber
tapepool field - @poliverc Update provided taped service files
- (unassigned) All tape drive related tools must specify the drive name
- (unassigned) Fix wrong session ID 18446744073709551615 (overflow)
- (unassigned) Allow files to be evicted during a 'prepare abort'
- @ewv Increase CI block size to 1 MB
- (unassigned) Cleanup campaign for function name in execeptions
-
@poliverc
cta-taped
behavior when tape cannot be mounted - @poliverc Cleaner session should always eject the tape, even when it can not contact the catalogue or scheduler DB
- @kskovola Add configuration on CTA gRPC frontend to block user and/or repack operations
- (unassigned) Allow listing of current cta-frontend configuration with a dedicated command
- @afonso Review tickets without workflow and/or priority
- (unassigned) Deprecate '--fromcastor' flag
- (unassigned) Simplify 'eos evict' output message
- (unassigned) Harmonize logging fields (and values) across code base.
- @nbugel Refactor CTA catalogue library code
-
(unassigned) Protect tape deletions from
eos fs rm
- @poliverc Eject tape when a tape server is stopped.
- @idlarbi Standalone CLI tools don't support specifying a path for the cta-cli.conf file
- (unassigned) Improve handling of try-catch blocks of 'cta::exception::Exception'
- @afonso Evaluate and remove generic try-blocks from the CTA catalogue code
- @jleduc Protect CI runners from disk space over consumption
- @mdavis Remove Drive objects from objectstore schema
-
(unassigned) Investigate
cta-restore-deleted-files
rare segfaults - @kskovola Fix inconsistencies on the 'cta-restore-deleted-files' tool
- (unassigned) Improve error message returned to user on invalid input (uint64)
- @kskovola Improvements on the 'cta-restore-deleted-files' tool: more than one recycle bin file matches the parameters passed
-
(unassigned) Use
organisation
consistently in CTA software - @afonso Cleanup repack files when using 'cta-admin repack rm'
- (unassigned) Add a cta-admin option to invalidate cta-frontend caches
- (unassigned) Refactor RDBMS functions for catching SQL exceptions
- @poliverc Tape session should fail if encryption script returns non-zero exit code
- @afonso (confidential)
- @kskovola Add cta-admin system test for gRPC frontend
- @poliverc Disk read failures should not disable tapes or drives
- (unassigned) Reduce the number of queue objects fetched in 'fetchMountInfo()'
- @mdavis The RAO list contains duplicate entries of files to retrieve
- @kskovola Implement the streaming admin command into frontend/grpc...
- @kskovola Implement the non-streaming admin commands into frontend/grpc...
- @jleduc Remove ctareg credentials for CTA container registry/API access
- (unassigned) Large number of CRITICAL errors in cta-taped - "In BackendPopulator::~BackendPopulator(): error deleting agent (cta::exception::Exception). Backtrace follows."
- @afonso Improve "error_text" message in query prepare
- @afonso Check if QueueCleanupRunner is properly checking all replicas for a requeued request
- (unassigned) Update catalogue_schema.pdf
- @mdavis Get the CTA team involved in the MHVTL project development and release planning
- (unassigned) No tape candidate found while retrieve job batch requeue during repack
-
@afonso Sort out unlabelled issues (no
workflow
label) - @idlarbi Amend code convention: include headers should use the complete path from the project root
-
@vlado
cta-tape-label
should have clearer errors - (unassigned) Port CASTOR tape format/tape server documentation to EOSCTA docs
- @mdavis Update scheduler to use file creation time provided by frontend
- (unassigned) Huge/wrong value on youngest age of retrieve queue (cta-admin sq)
- (unassigned) Pass tapeserver::drive::DriveInterface as unique_ptr instead of reference
- (unassigned) Unit test for what happens during an archive mount if the tape becomes full
- @guenther Remove setDriveStatus() from OStoreDB
- @mdavis Integrate cta-frontend-grpc into CTA repo and RPMs
- @afonso Improve release workflow
- (unassigned) Don't queue reports with type NoReportRequired
- (unassigned) Detect and fix ambiguous partial template specialisation
- @idlarbi Add file verification workflow to CTA CI
- (unassigned) Monitor cta-frontend thread usage
- (unassigned) CTA Schema Version Management
- @poliverc Cataloging reasons for tape session failures
- (unassigned) Review scheduler retry logic for archive and retrieve
- @jleduc Do not queue file for retrieve if EOS size/checksum do not match CTA Catalogue
- (unassigned) Mount rule cache is not updated
- (unassigned) Tape server can only deal with archival errors that occur before writing the first block
- (unassigned) Use file creation time to improve data colocation during repack
- (unassigned) CI test the retry of a "layout only" conversion job
- (unassigned) Add test that file has been deleted from FST
- (unassigned) Harmonize retrieve/recall reporting interface for success.
- (unassigned) Risk of database deadlock during bulk insert to the catalogue
📝 Issues missing descriptions
These issues have no description and need clarification:
- (unassigned) Avoid code duplication where possible between WFE request --> queueRetrieve --> insert() and RepackRequest --> RetrieveRequest --> insert.
- @ewv Fix Enstore-reading Sonarcloud issues
- @kskovola Ensure CI stress test job logs properly track the test and indicate success/failure appropriately
- @guenther Implement repack queueing for Relational DB scheduler backend
- @guenther [CI] Investigate why ctaStorageClass_3_copy has numberofcopies set to 1 while having 3 defined copyNbs in archiveroute
-
@kskovola Replace
SslServerCredentials
withTlsServerCredentials
in gRPC frontend - @guenther Report remaining "successful" job batch in buffer as failed after exception is thrown in TapeWriteSingleThread
- @guenther Revise all places where jobs need to be returned (releaseJob()) to the JobPool, especially for requeueing and failures
-
@kskovola Authentication between CTA gRPC Frontend, EOS and
cta-admin
-
@kskovola Support passing
gtest_filter
flag inbuild_deploy
script - @kskovola Add command-line option to override default config file for gRPC frontend
- @guenther Make number of connections for Scheduler DB Init configurable
- @guenther Review code to use m_* for private members
- @guenther Implement getNextFailedArchiveRepackReportBatch
- @guenther Implement getNextFailedRetrieveRepackReportBatch
- @guenther Implement getNextSuccessfulRetrieveRepackReportBatch
- @guenther Refine implementation of getNextArchiveJobsToReportBatch
- @afonso Review tickets without workflow and/or priority
🙋 Unassigned issues older than 2 weeks
These issues are old and still unassigned:
– Avoid code duplication where possible between WFE request --> queueRetrieve --> insert() and RepackRequest --> RetrieveRequest --> insert.
– Make mhVTL usable for scaling tests with containerised environment
– Summary of issues with MHVTL
– Fix usage of cta-admin options of type OptionStrList::FILE_ID
– Reduce restrictions imposed on repacked tape files being written to the CTA catalogue
– Replace the json-c library with jsoncpp
– Gracefully finish drive operations on demand and put the drive down
– Put getUsername
and getHostname
functions into common code which can be reused
– Strategy to handle small files on CTA
– Remove or finish implementation of cta-admin commands which are not functional
– Review and clean up Exceptions in CTA
– Add CRC32c comparison values to the logs when the check fails
– Clarifying the meaning of 'instance' word throughout the code, logging and command outputs.
– Allow choosing between Oracle and Postgres RPMs
– Allow the cta-admin tool to modify the physical library model and type
– Add CTA logger to drive::DriveGeneric logic
– Cleanup unused files from the CTA project
– CTA forgets files on disk and never moves them to tape
– Make cta-admin JSON only
– Fix 'priority' column in 'cta-admin dr ls' command
– Add way to test liquibase migration scripts for Postgres in CI
– All tape drive related tools must specify the drive name
– Fix wrong session ID 18446744073709551615 (overflow)
– Allow files to be evicted during a 'prepare abort'
– Cleanup campaign for function name in execeptions
– Allow listing of current cta-frontend configuration with a dedicated command
– Deprecate '--fromcastor' flag
– Simplify 'eos evict' output message
– Harmonize logging fields (and values) across code base.
– Protect tape deletions from eos fs rm
– Improve handling of try-catch blocks of 'cta::exception::Exception'
– Investigate cta-restore-deleted-files
rare segfaults
– Improve error message returned to user on invalid input (uint64)
– Use organisation
consistently in CTA software
– Add a cta-admin option to invalidate cta-frontend caches
– Refactor RDBMS functions for catching SQL exceptions
– Reduce the number of queue objects fetched in 'fetchMountInfo()'
– Large number of CRITICAL errors in cta-taped - "In BackendPopulator::~BackendPopulator(): error deleting agent (cta::exception::Exception). Backtrace follows."
– Update catalogue_schema.pdf
– No tape candidate found while retrieve job batch requeue during repack
– Port CASTOR tape format/tape server documentation to EOSCTA docs
– Huge/wrong value on youngest age of retrieve queue (cta-admin sq)
– Pass tapeserver::drive::DriveInterface as unique_ptr instead of reference
– Unit test for what happens during an archive mount if the tape becomes full
– Don't queue reports with type NoReportRequired
– Detect and fix ambiguous partial template specialisation
– Monitor cta-frontend thread usage
– CTA Schema Version Management
– Review scheduler retry logic for archive and retrieve
– Mount rule cache is not updated
– Tape server can only deal with archival errors that occur before writing the first block
– Use file creation time to improve data colocation during repack
– CI test the retry of a "layout only" conversion job
– Add test that file has been deleted from FST
– Harmonize retrieve/recall reporting interface for success.
– Risk of database deadlock during bulk insert to the catalogue
🗣️ Issues needing discussion but inactive
These issues are marked needs discussion
and haven't been updated in 14+ days:
- @afonso CTA SonarCloud Hackaton
- @poliverc Add mechanism to list the differences between a tape server current configuration and the corresponding configuration file
Generated by the GitLab Triage Bot on 2025-09-19