post-hackathon + template merge + modules/subworkflows update by vagkaratzas · Pull Request #98 · nf-core/proteinannotator

vagkaratzas · 2026-05-05T12:12:48Z

`Added`

#90 - Added the option to download and use the latest metagRoot HMM library (or use path to an existing one) for domain annotation. (by @angelphanth)
#87 - Added the option to download and use the latest NMPFams HMM library (or use path to an existing one) for domain annotation. (by @npechl)
#85 - Added zenodo doi in nextflow.config. (by @vagkaratzas)

`Changed`

#93 - nf-core tools template update to 4.0.2. (by @vagkaratzas)
#85 - test_full.config input samplesheet path is now set properly. (by @vagkaratzas)

`Dependencies`

Tool	Previous version	New version
aria2	1.36.0	1.37.0
multiqc	1.33	1.34

PR checklist

post-release version bump

… to nextflow.config

…duction full test correct input samplesheet path, typos fix, zenodo doi added…

…root tar file?

Important! Template update for nf-core/tools v4.0.2

update nf-core modules and subworkflows to latest

pre-release v bump

github-actions · 2026-05-05T12:14:55Z

`nf-core pipelines lint` overall result: Passed ✅ ⚠️

Posted for pipeline commit 696ad48

+| ✅ 228 tests passed       |+
#| ❔   7 tests were ignored |#
!| ❗   1 tests had warnings |!

Details

❗ Test warnings:

pipeline_if_empty_null - ifEmpty(null) found in /home/runner/work/proteinannotator/proteinannotator/lint_results.md: _* pipeline_if_empty_null - No ifEmpty(null) strings found
_

❔ Tests ignored:

files_exist - File is ignored: .github/workflows/ci.yml
files_exist - File is ignored: conf/igenomes.config
files_exist - File is ignored: conf/igenomes_ignored.config
files_unchanged - File ignored due to lint config: .github/PULL_REQUEST_TEMPLATE.md
files_unchanged - File ignored due to lint config: assets/nf-core-proteinannotator_logo_light.png
files_unchanged - File ignored due to lint config: docs/images/nf-core-proteinannotator_logo_light.png
files_unchanged - File ignored due to lint config: docs/images/nf-core-proteinannotator_logo_dark.png

✅ Tests passed:

files_exist - File found: .gitattributes
files_exist - File found: .gitignore
files_exist - File found: .nf-core.yml
files_exist - File found: .prettierignore
files_exist - File found: .prettierrc.yml
files_exist - File found: CHANGELOG.md
files_exist - File found: CITATIONS.md
files_exist - File found: CODE_OF_CONDUCT.md
files_exist - File found: LICENSE or LICENSE.md or LICENCE or LICENCE.md
files_exist - File found: nextflow_schema.json
files_exist - File found: nextflow.config
files_exist - File found: README.md
files_exist - File found: .github/.dockstore.yml
files_exist - File found: .github/ISSUE_TEMPLATE/bug_report.yml
files_exist - File found: .github/ISSUE_TEMPLATE/config.yml
files_exist - File found: .github/ISSUE_TEMPLATE/feature_request.yml
files_exist - File found: .github/PULL_REQUEST_TEMPLATE.md
files_exist - File found: .github/workflows/branch.yml
files_exist - File found: .github/workflows/nf-test.yml
files_exist - File found: .github/actions/get-shards/action.yml
files_exist - File found: .github/actions/nf-test/action.yml
files_exist - File found: .github/workflows/linting_comment.yml
files_exist - File found: .github/workflows/linting.yml
files_exist - File found: assets/email_template.html
files_exist - File found: assets/email_template.txt
files_exist - File found: assets/sendmail_template.txt
files_exist - File found: assets/nf-core-proteinannotator_logo_light.png
files_exist - File found: conf/modules.config
files_exist - File found: conf/test.config
files_exist - File found: conf/test_full.config
files_exist - File found: docs/CONTRIBUTING.md
files_exist - File found: docs/images/nf-core-proteinannotator_logo_light.png
files_exist - File found: docs/images/nf-core-proteinannotator_logo_dark.png
files_exist - File found: docs/output.md
files_exist - File found: docs/README.md
files_exist - File found: docs/README.md
files_exist - File found: docs/usage.md
files_exist - File found: nf-test.config
files_exist - File found: tests/default.nf.test
files_exist - File found: main.nf
files_exist - File found: assets/multiqc_config.yml
files_exist - File found: conf/base.config
files_exist - File found: .github/workflows/awstest.yml
files_exist - File found: .github/workflows/awsfulltest.yml
files_exist - File found: modules.json
files_exist - File found: ro-crate-metadata.json
files_exist - File not found check: .github/ISSUE_TEMPLATE/bug_report.md
files_exist - File not found check: .github/ISSUE_TEMPLATE/feature_request.md
files_exist - File not found check: .github/workflows/push_dockerhub.yml
files_exist - File not found check: .markdownlint.yml
files_exist - File not found check: .nf-core.yaml
files_exist - File not found check: .yamllint.yml
files_exist - File not found check: bin/markdown_to_html.r
files_exist - File not found check: conf/aws.config
files_exist - File not found check: docs/images/nf-core-proteinannotator_logo.png
files_exist - File not found check: lib/Checks.groovy
files_exist - File not found check: lib/Completion.groovy
files_exist - File not found check: lib/NfcoreTemplate.groovy
files_exist - File not found check: lib/Utils.groovy
files_exist - File not found check: lib/Workflow.groovy
files_exist - File not found check: lib/WorkflowMain.groovy
files_exist - File not found check: lib/WorkflowProteinannotator.groovy
files_exist - File not found check: parameters.settings.json
files_exist - File not found check: pipeline_template.yml
files_exist - File not found check: Singularity
files_exist - File not found check: lib/nfcore_external_java_deps.jar
files_exist - File not found check: .travis.yml
nextflow_config - Found nf-schema plugin
nextflow_config - Config variable found: manifest.name
nextflow_config - Config variable found: manifest.nextflowVersion
nextflow_config - Config variable found: manifest.description
nextflow_config - Config variable found: manifest.version
nextflow_config - Config variable found: manifest.homePage
nextflow_config - Config variable found: timeline.enabled
nextflow_config - Config variable found: trace.enabled
nextflow_config - Config variable found: report.enabled
nextflow_config - Config variable found: dag.enabled
nextflow_config - Config variable found: process.cpus
nextflow_config - Config variable found: process.memory
nextflow_config - Config variable found: process.time
nextflow_config - Config variable found: params.outdir
nextflow_config - Config variable found: params.input
nextflow_config - Config variable found: manifest.mainScript
nextflow_config - Config variable found: timeline.file
nextflow_config - Config variable found: trace.file
nextflow_config - Config variable found: report.file
nextflow_config - Config variable found: dag.file
nextflow_config - Config variable (correctly) not found: params.nf_required_version
nextflow_config - Config variable (correctly) not found: params.container
nextflow_config - Config variable (correctly) not found: params.singleEnd
nextflow_config - Config variable (correctly) not found: params.igenomesIgnore
nextflow_config - Config variable (correctly) not found: params.name
nextflow_config - Config variable (correctly) not found: params.enable_conda
nextflow_config - Config variable (correctly) not found: params.max_cpus
nextflow_config - Config variable (correctly) not found: params.max_memory
nextflow_config - Config variable (correctly) not found: params.max_time
nextflow_config - Config variable (correctly) not found: params.validationFailUnrecognisedParams
nextflow_config - Config variable (correctly) not found: params.validationLenientMode
nextflow_config - Config variable (correctly) not found: params.validationSchemaIgnoreParams
nextflow_config - Config variable (correctly) not found: params.validationShowHiddenParams
nextflow_config - Config variable (correctly) not found: validation.failUnrecognisedParams
nextflow_config - Config variable (correctly) not found: validation.failUnrecognisedHeaders
nextflow_config - Config timeline.enabled had correct value: true
nextflow_config - Config report.enabled had correct value: true
nextflow_config - Config trace.enabled had correct value: true
nextflow_config - Config dag.enabled had correct value: true
nextflow_config - Config manifest.name began with nf-core/
nextflow_config - Config variable manifest.homePage began with https://github.com/nf-core/
nextflow_config - Config dag.file ended with .html
nextflow_config - Config variable manifest.nextflowVersion started with >= or !>=
nextflow_config - Config manifest.version does not contain dev for release: 1.1.0
nextflow_config - Config params.custom_config_version is set to master
nextflow_config - Config params.custom_config_base is set to https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Lines for loading custom profiles found
nextflow_config - nextflow.config contains configuration profile test
nextflow_config - Config default value correct: params.custom_config_version= master
nextflow_config - Config default value correct: params.custom_config_base= https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Config default value correct: params.publish_dir_mode= copy
nextflow_config - Config default value correct: params.max_multiqc_email_size= 25.MB
nextflow_config - Config default value correct: params.validate_params= true
nextflow_config - Config default value correct: params.pipelines_testdata_base_path= https://raw.githubusercontent.com/nf-core/test-datasets/
nextflow_config - Config default value correct: params.min_seq_length= 30
nextflow_config - Config default value correct: params.max_seq_length= 5000
nextflow_config - Config default value correct: params.pfam_latest_link= https://ftp.ebi.ac.uk/pub/databases/Pfam/current_release/Pfam-A.hmm.gz
nextflow_config - Config default value correct: params.funfam_latest_link= https://download.cathdb.info/cath/releases/all-releases/v4_3_0/sequence-data/funfam-hmm3-v4_3_0.lib.gz
nextflow_config - Config default value correct: params.nmpfams_latest_link= https://pavlopoulos-lab.org/envofams/databases/hmmer/nmpfamsdb.hmm.gz
nextflow_config - Config default value correct: params.metagroot_latest_link= https://pavlopoulos-lab.org/envofams/databases/hmmer/metagroot.hmm.gz
nextflow_config - Config default value correct: params.hmmsearch_evalue_cutoff= 0.001
nextflow_config - Config default value correct: params.interproscan_db_url= https://ftp.ebi.ac.uk/pub/software/unix/iprscan/5/5.72-103.0/interproscan-5.72-103.0-64-bit.tar.gz
nextflow_config - Config default value correct: params.interproscan_applications= Hamap,PANTHER,PIRSF,TIGRFAM,sfld
nextflow_config - Config default value correct: params.s4pred_outfmt= ss2
nf_test_content - 'tests/default.nf.test' contains outdir parameter
nf_test_content - 'tests/default.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/default.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/nextflow.config' contains modules_testdata_base_path
nf_test_content - 'tests/nextflow.config' contains pipelines_testdata_base_path
nf_test_content - 'nf-test.config' sets a testsDir
nf_test_content - 'nf-test.config' sets a workDir
nf_test_content - 'nf-test.config' sets a configFile
files_unchanged - .gitattributes matches the template
files_unchanged - .prettierrc.yml matches the template
files_unchanged - CODE_OF_CONDUCT.md matches the template
files_unchanged - LICENSE matches the template
files_unchanged - .github/.dockstore.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/bug_report.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/config.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/feature_request.yml matches the template
files_unchanged - .github/workflows/branch.yml matches the template
files_unchanged - .github/workflows/linting_comment.yml matches the template
files_unchanged - .github/workflows/linting.yml matches the template
files_unchanged - assets/email_template.html matches the template
files_unchanged - assets/email_template.txt matches the template
files_unchanged - assets/sendmail_template.txt matches the template
files_unchanged - docs/README.md matches the template
files_unchanged - .gitignore matches the template
files_unchanged - .prettierignore matches the template
actions_nf_test - '.github/workflows/nf-test.yml' is triggered on expected events
actions_nf_test - '.github/workflows/nf-test.yml' checks minimum NF version
actions_awstest - '.github/workflows/awstest.yml' is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml does not use -profile test
readme - README Nextflow minimum version badge matched config. Badge: 25.10.4, Config: 25.10.4
readme - README nf-core template version badge found.
readme - README Zenodo placeholder was replaced with DOI.
pipeline_todos - No TODO strings found
plugin_includes - No wrong validation plugin imports have been found
pipeline_name_conventions - Name adheres to nf-core convention
template_strings - Did not find any Jinja template strings (0 files)
schema_lint - Schema lint passed
schema_lint - Schema title + description lint passed
schema_lint - Input mimetype lint passed: 'text/csv'
schema_params - Schema matched params returned from nextflow config
system_exit - No System.exit calls found
actions_schema_validation - Workflow validation passed: linting_comment.yml
actions_schema_validation - Workflow validation passed: clean-up.yml
actions_schema_validation - Workflow validation passed: awsfulltest.yml
actions_schema_validation - Workflow validation passed: fix_linting.yml
actions_schema_validation - Workflow validation passed: template-version-comment.yml
actions_schema_validation - Workflow validation passed: linting.yml
actions_schema_validation - Workflow validation passed: download_pipeline.yml
actions_schema_validation - Workflow validation passed: nf-test.yml
actions_schema_validation - Workflow validation passed: branch.yml
actions_schema_validation - Workflow validation passed: release-announcements.yml
actions_schema_validation - Workflow validation passed: awstest.yml
merge_markers - No merge markers found in pipeline files
modules_json - Only installed modules found in modules.json
multiqc_config - assets/multiqc_config.yml found and not ignored.
multiqc_config - assets/multiqc_config.yml contains report_section_order
multiqc_config - assets/multiqc_config.yml contains export_plots
multiqc_config - assets/multiqc_config.yml contains report_comment
multiqc_config - assets/multiqc_config.yml follows the ordering scheme of the minimally required plugins.
multiqc_config - assets/multiqc_config.yml contains a matching 'report_comment'.
multiqc_config - assets/multiqc_config.yml contains 'export_plots: true'.
modules_structure - modules directory structure is correct 'modules/nf-core/TOOL/SUBTOOL'
local_component_structure - local subworkflows directory structure is correct 'subworkflows/local/TOOL/SUBTOOL'
base_config - conf/base.config found and not ignored.
modules_config - conf/modules.config found and not ignored.
modules_config - NFCORE_PROTEINANNOTATOR found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_PROTEINANNOTATOR found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_PROTEINANNOTATOR found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_PROTEINANNOTATOR found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_PROTEINANNOTATOR found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_PROTEINANNOTATOR found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_PROTEINANNOTATOR found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_PROTEINANNOTATOR found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_PROTEINANNOTATOR found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_PROTEINANNOTATOR found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_PROTEINANNOTATOR found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_PROTEINANNOTATOR found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_PROTEINANNOTATOR found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_PROTEINANNOTATOR found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_PROTEINANNOTATOR found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_PROTEINANNOTATOR found in conf/modules.config and Nextflow scripts.
modules_config - NFCORE_PROTEINANNOTATOR found in conf/modules.config and Nextflow scripts.
modules_config - MULTIQC found in conf/modules.config and Nextflow scripts.
nfcore_yml - Repository type in .nf-core.yml is valid: pipeline
nfcore_yml - nf-core version in .nf-core.yml is set to the latest version: 4.0.2
rocrate_readme_sync - RO-Crate descriptions are in sync with README.md.
container_configs - conf/containers_conda_lock_files_amd64.config is up to date
container_configs - conf/containers_conda_lock_files_arm64.config is up to date
container_configs - conf/containers_docker_amd64.config is up to date
container_configs - conf/containers_docker_arm64.config is up to date
container_configs - conf/containers_singularity_https_amd64.config is up to date
container_configs - conf/containers_singularity_https_arm64.config is up to date
container_configs - conf/containers_singularity_oras_amd64.config is up to date
container_configs - conf/containers_singularity_oras_arm64.config is up to date
version_consistency - Version tags are consistent: manifest.version = 1.1.0, nfcore_yml.version = 1.1.0
included_configs - Pipeline config includes custom configs.

Run details

nf-core/tools version 4.0.2
Run at 2026-05-07 13:20:00

Aratz

Looks good! Just had some minor suggestions for you to address in this version or the next one.

Aratz · 2026-05-06T12:18:01Z


 You can also generate such `YAML`/`JSON` files via [nf-core/launch](https://nf-co.re/launch).

 ## Functional Annotation Options


Here you could add a similar section for domain annotation tools

It is a bit more straightforward then Functional Annotation, but I agree that a couple of sentences with database enabling aprameters would make sense; either now or in the future. Noted

pinin4fjords

AI-assisted review (Claude, on behalf of @pinin4fjords).

This is a release PR (dev -> main) bundling a 4.0.2 template merge, modules/subworkflows update, and the new metagRoot domain annotation. Most issues below are doc/schema polish; one (H1) breaks the AWS full test on default params.

Findings

1 high (broken full-test URL)
6 medium inline + 2 medium below (schema typos / orphan section / unused module / hmmer description)
1 low (functional-annotation snapshot wraps a boolean)
3 informational (IPS binary/data version skew, full-test still uses test-sized data, empty PR description)

All claims grepped against commit 04b0928. Nothing here blocks merge on its own; H1 is the only one I'd want fixed before tagging v1.1.0.

Additional findings on lines outside this PR's diff

These touch lines the PR didn't change, so GitHub won't accept them as inline comments. Listing them here so the polish can land alongside the rest.

M5 (medium) - docs/output.md:117 - .gff should be .gff3
The InterProScan module emits *.gff3 (modules/nf-core/interproscan/main.nf:18) and the snapshot confirms .gff3 is what publishes under functional_annotation/interproscan/<sample>/. The line currently reads - `<samplename>.gff`: general feature format (GFF) file and should be <samplename>.gff3.

M7 (medium) - docs/output.md:392-402 - orphan "SeqKit stats" section
Documents a seqkit/<prefix>.tsv output that the pipeline does not produce. grep -rn SEQKIT_STATS only matches files inside modules/nf-core/seqkit/stats/ itself - the module is installed but never imported or invoked anywhere (see also M8 inline on modules.json). The QC TSVs that are produced come from SeqFu and are already documented in the SeqFu section above. Either remove this section, or wire SEQKIT_STATS into FAA_SEQFU_SEQKIT if it was meant to be added.

M1 (medium) - nextflow_schema.json:347 - typo in interproscan_enableprecalc help_text
---diasable-precalc should be --disable-precalc (three dashes, plus "diasable" misspelled). The actual InterProScan flag is --disable-precalc, which is what conf/modules.config:173 correctly passes - so this is purely cosmetic, but it does end up in --help output and the parameter docs site.

M4 (medium) - nextflow_schema.json:319-323 - skip_interproscan description is inverted
For a skip_* flag, "Run InterProScan" reads the wrong way around. Match the wording style of skip_pfam/skip_funfam/etc.: "description": "Skip the functional annotation with InterProScan.". Also the explicit "default": false is unique to this entry among the skip flags - either drop it or add it consistently.

I3 (informational) - empty PR description
The PR body is just the unfilled checklist. For a release-target PR (dev -> main), a 3-4 line summary of what's bundled (template 4.0.2, modules sync, metagRoot, NMPFams) helps reviewers and feeds release notes.

pinin4fjords · 2026-05-07T10:56:43Z

+    nmpfams_latest_link     = params.pipelines_testdata_base_path + 'proteinannotator/testdata/nmpfams/nmpfamsdb_test.hmm.gz'
+    metagroot_latest_link   = params.pipelines_testdata_base_path + 'proteinannotator/testdata/metagroot/metagroot_test.hmm.gz'
    // Functional annotation
    interproscan_db_url       = params.pipelines_testdata_base_path + 'proteinannotator/testdata/interproscan_test.tar.gz'


Severity: high - this URL 404s on the test-datasets repo, so -profile test_full (i.e. the AWS full test) will fail at the InterProScan database download step.

Verified:

…/proteinannotator/testdata/interproscan_test.tar.gz -> 404

…/proteinannotator/testdata/interproscan/interproscan_test.tar.gz -> 200 (the path conf/test.config:33 already uses)

The samplesheet path in this file was fixed in this PR (#85), but this URL was missed.

Suggested change

interproscan_db_url = params.pipelines_testdata_base_path + 'proteinannotator/testdata/interproscan_test.tar.gz'

interproscan_db_url = params.pipelines_testdata_base_path + 'proteinannotator/testdata/interproscan/interproscan_test.tar.gz'

pinin4fjords · 2026-05-07T10:56:44Z

+                    "type": "string",
+                    "format": "file-path",
+                    "description": "Path to an already installed NMPFams HMM database.",
+                    "help_text": "If left null and skip_funfam is false, the pipeline will start downloading the latest FunFam HMM library."


Severity: medium - copy-paste from the FunFam block. The help_text refers to skip_funfam and "FunFam HMM library" but this is the nmpfams_db parameter.

Suggested change

"help_text": "If left null and skip_funfam is false, the pipeline will start downloading the latest FunFam HMM library."

"help_text": "If left null and skip_nmpfams is false, the pipeline will start downloading the latest NMPFams HMM library."

pinin4fjords · 2026-05-07T10:56:44Z

+                "nmpfams_latest_link": {
+                    "type": "string",
+                    "default": "https://pavlopoulos-lab.org/envofams/databases/hmmer/nmpfamsdb.hmm.gz",
+                    "description": ""


Severity: medium - description is empty for nmpfams_latest_link. The other *_latest_link entries (pfam, funfam, metagroot) all have one. Suggested fill (mirroring the metagroot wording):

Suggested change

"description": ""

"description": "NMPFams hosted link to the latest NMPFams HMM database file."

pinin4fjords · 2026-05-07T10:56:44Z

-Each of the `domain_annotation/` subfolders (e.g., `pfam`, `funfam`) contain a `.domtbl.gz` annotation file per input sample, depending on which domain annotation databases were used in the pipeline execution.
+Each of the `domain_annotation/` subfolders (e.g., `pfam`, `funfam`, `nmpfams`, `metagroot`) contain a `.domtbl.gz` annotation file per input sample, depending on which domain annotation databases were used in the pipeline execution.

 [hmmer](https://github.com/EddyRivasLab/hmmer) is a fast and flexible alignment trimming tool that keeps phylogenetically informative sites and removes others.


Severity: medium - this sentence describes trimAl, not hmmer. hmmer is a profile-HMM-based sequence search tool. Looks like leftover copy-paste from another pipeline's output docs.

Suggested change

[hmmer](https://github.com/EddyRivasLab/hmmer) is a fast and flexible alignment trimming tool that keeps phylogenetically informative sites and removes others.

[hmmer](https://github.com/EddyRivasLab/hmmer) (HMMER) is a sequence search tool that uses profile hidden Markov models (profile HMMs) to identify homologous sequences against curated databases such as Pfam, FunFam, NMPFams and metagRoot.

pinin4fjords · 2026-05-07T10:56:44Z

                    "seqkit/stats": {
                        "branch": "master",
-                        "git_sha": "28935b89b7e1f19e835f8c6e4c8322d4b505dded",
+                        "git_sha": "6d46786420b4d7bc88eba026eb389c0c5535d120",
                        "installed_by": ["modules"]
                    },


Severity: medium - seqkit/stats is installed (installed_by: ["modules"]) but never imported anywhere in the workflow. grep -rn SEQKIT_STATS only hits files inside modules/nf-core/seqkit/stats/ itself.

Either remove with nf-core modules remove seqkit/stats (which also lets you drop docs/output.md's orphan SeqKit-stats section, see M7 in the review body), or import and use it. Carrying an unused module bloats the snapshot footprint and the docs drift.

pinin4fjords · 2026-05-07T10:56:44Z

+                { assert snapshot(
+                    path(workflow.out.interproscan_tsv[0][1]).readLines()[0]
+                        .contains("GI|225038609|EFDID|719595|FULL	079fff43a0270e432d339ea71b6f0acf	350	SFLD	SFLDS00057	Glutaminase/Asparaginase	17	347	0.0	T")
+                ).match()}


Severity: low - the snapshot value here is just true (see main.nf.test.snap:4), because snapshot() is being passed the result of a .contains() boolean check. That means the test only catches a regression where line 0 of the TSV stops containing that exact substring - any other change to the TSV body slides past silently. None of the rest of the channel content is snapshotted either.

Suggested split: assert the contains-check directly, and snapshot the actual TSV channel separately.

Suggested change

{ assert snapshot(

path(workflow.out.interproscan_tsv[0][1]).readLines()[0]

.contains("GI|225038609|EFDID|719595|FULL 079fff43a0270e432d339ea71b6f0acf 350 SFLD SFLDS00057 Glutaminase/Asparaginase 17 347 0.0 T")

).match()}

then {

assertAll(

{ assert workflow.success },

{ assert path(workflow.out.interproscan_tsv[0][1]).readLines()[0]

.contains("GI|225038609|EFDID|719595|FULL\t079fff43a0270e432d339ea71b6f0acf\t350\tSFLD\tSFLDS00057\tGlutaminase/Asparaginase\t17\t347\t0.0\tT") },

{ assert snapshot(workflow.out.interproscan_tsv).match() }

)

}

tsv is not matching; i'll keep as it is for now

pinin4fjords · 2026-05-07T10:56:44Z

    conda "${moduleDir}/environment.yml"
-    container "${ workflow.containerEngine == 'singularity' && !task.ext.singularity_pull_docker_container ?
+    container "${ workflow.containerEngine in ['singularity', 'apptainer'] && !task.ext.singularity_pull_docker_container ?
        'https://depot.galaxyproject.org/singularity/interproscan:5.59_91.0--hec16e2b_1' :


Severity: informational (pre-existing, but ships with this v1.1.0 tag).

The module pins interproscan:5.59_91.0--hec16e2b_1, but the default interproscan_db_url in nextflow.config:38 is interproscan-5.72-103.0, and docs/usage.md:91-94 instructs users to download v5.72-103.0 manually. InterProScan data files are not forward/backward compatible across binary versions, so default-param users will hit a runtime mismatch. CI passes because the test fixture is version-matched to 5.59.

Not introduced by this PR, but the v1.1.0 tag here is a reasonable trigger to either (a) pin the default URL to a 5.59-91.0 tarball, or (b) bump the binary to a 5.72-compatible biocontainer when one becomes available, and update docs/usage.md to match.

InterProScan is a difficult case; the final version (no more updates coming) with a conda version are stuck on 5.59_91.0. However the team has been developing newer versions (whole Nextflow --non nf-core-- pipeline) and we aim at some point to move there in the nf-core module, maybe....for now these two versions (module + database) seem compatible, so keeping it like this.

pinin4fjords · 2026-05-07T10:56:44Z

+    metagroot_latest_link   = params.pipelines_testdata_base_path + 'proteinannotator/testdata/metagroot/metagroot_test.hmm.gz'
    // Functional annotation
    interproscan_db_url       = params.pipelines_testdata_base_path + 'proteinannotator/testdata/interproscan_test.tar.gz'
    interproscan_applications = 'Hamap,TIGRFAM,sfld'


Severity: informational - after fixing H1 above, test_full would still point at the same test-sized HMM and InterProScan archives as conf/test.config. The CHANGELOG entry "test_full.config input samplesheet path is now set properly" suggests this is intentional for now, but it means the AWS full test isn't actually exercising a full-size workload. If a real full-size dataset is planned, that's a follow-up - just flagging it isn't currently distinct from test.config.

Nothing planend so far. Will need to carefully come up with a different full size dataset when the pipeline is more mature.

pinin4fjords

AI-assisted follow-up review (Claude, on behalf of @pinin4fjords). Snapshot-hygiene suggestion on the domain_annotation tests.

pinin4fjords · 2026-05-07T11:20:22Z

+                { assert snapshot(
+                    path(workflow.out.nmpfams_domains[0][1]).linesGzip[0..7],
+                    workflow.out.versions.collect { path(it).yaml }.unique()
+                    ).match()}
+            )


Severity: low - the four linesGzip[0..7] assertions in this file (lines 36/73/109/145) inline raw rows into the snapshot. Same coverage, but a hash in the .snap instead of raw content - anchored here on the nmpfams block since the others are outside the diff:

Suggested change

{ assert snapshot(

path(workflow.out.nmpfams_domains[0][1]).linesGzip[0..7],

workflow.out.versions.collect { path(it).yaml }.unique()

).match()}

)

then {

assertAll(

{ assert workflow.success},

{ assert snapshot(

path(workflow.out.nmpfams_domains[0][1]).linesGzip[0..7].join('\n').md5(),

workflow.out.versions.collect { path(it).yaml }.unique()

).match()}

)

}

Same swap applies to the three other linesGzip[0..7] blocks.

I am not not fond of seeing actual lines of character in the snapshots, expecially if they are only a couple of lines like here, rather than md5sums. Could have both I guess, but for now I'll leave as is. Interested for documentation links if there are new md5sum guidelines that I've missed!

pinin4fjords

Nothing huge here. The AI found one critical issue and a load of things I think it would be worth you fixing. I also think you could use .md5() to keep the snapshots tidier on.

Trust you to resolve as appropriate!

vagkaratzas · 2026-05-07T12:24:25Z

Nothing huge here. The AI found one critical issue and a load of things I think it would be worth you fixing. I also think you could use .md5() to keep the snapshots tidier on.

Trust you to resolve as appropriate!

Is .md5() a new thing in the latest version of nf-test? Are we also adopt it in nf-core/modules? Link any nf-core documentation that points to that please :D

Update reviewer comments

vagkaratzas and others added 30 commits February 9, 2026 14:05

post-release version bump

a1a8346

Merge pull request #75 from vagkaratzas/dev

f02fbba

post-release version bump

first commit

60b847d

update configs

bec80f6

update utils

18d9b16

fix link

7dc19a6

update domain_annotation meta

3a50b21

update main workflow

80758f5

fix typos

acef4be

full test correct input samplesheet path, typos fix, zenodo doi added…

b8f62f1

… to nextflow.config

Merge pull request #85 from nf-core/minor-fixes-typos-and-usage-intro…

5a7b85b

…duction full test correct input samplesheet path, typos fix, zenodo doi added…

include testing

f70acaa

fix naming

21e5436

Merge branch 'dev' into issue_77

392021e

fix naming

01310a0

fix testing

185ce5a

fix typo

9a6bc83

fix typo

c88e6c4

copying integration of pfam and funfam

2ad495f

nf-core modules wget and untar as a workaround for aria2 to get metag…

63fd565

…root tar file?

update testdata link

585808d

update testddata path

023f0ca

update snap

9908739

update docs

c6b401e

update changelog & readme

c1ec64e

Create main.nf.test

3787af6

Create meta.yml

e5cb774

nextflow.config

c454590

main.nf.test.snap

300d7db

main.nf.test

c077dc6

vagkaratzas and others added 12 commits May 5, 2026 10:28

ro-crate lint

cc0cab2

monochrome logs and prek all files fix

863ba7d

Merge pull request #93 from nf-core/nf-core-template-merge-4.0.2

8942468

Important! Template update for nf-core/tools v4.0.2

update nf-core modules and subworkflows to latest

3b2b74e

re-trigger

3e1e22b

local subworkflow snapshots updated

b4f86a1

proper domain_annotation snapshot

7347759

removed FUNCTIONAL_ANNOTATION version outputs from main workflow

71f7043

Merge pull request #96 from nf-core/update-m_s

2873297

update nf-core modules and subworkflows to latest

pre-release v bump

9ab9bb9

dev removed from end-to-end snapshot

b7900c8

Merge pull request #97 from vagkaratzas/dev

04b0928

pre-release v bump

Aratz self-requested a review May 6, 2026 08:23

Aratz approved these changes May 6, 2026

View reviewed changes

usage.md small fix

0925dca

pinin4fjords reviewed May 7, 2026

View reviewed changes

pinin4fjords approved these changes May 7, 2026

View reviewed changes

vagkaratzas added 5 commits May 7, 2026 13:35

nf-core update hmmsearch and utils subworkflows

3f0ae71

removed unused seqkit/stats module and output.md reference

d13170c

schema typo fixes

e7ed8f4

correct full_test interpro path

855bb86

more typos in output.md and schema fixes

bc08b8e

pinin4fjords mentioned this pull request May 7, 2026

Update reviewer comments #99

Merged

11 tasks

vagkaratzas and others added 3 commits May 7, 2026 14:04

dashes typo fix

e9428fc

two points appended at CONTRIBUTING.md

5da123e

Merge pull request #99 from nf-core/update-reviewer-comments

696ad48

Update reviewer comments

vagkaratzas merged commit cbf78d4 into main May 7, 2026
51 checks passed


		You can also generate such `YAML`/`JSON` files via [nf-core/launch](https://nf-co.re/launch).

		## Functional Annotation Options

	interproscan_db_url = params.pipelines_testdata_base_path + 'proteinannotator/testdata/interproscan_test.tar.gz'
	interproscan_db_url = params.pipelines_testdata_base_path + 'proteinannotator/testdata/interproscan/interproscan_test.tar.gz'

	"help_text": "If left null and skip_funfam is false, the pipeline will start downloading the latest FunFam HMM library."
	"help_text": "If left null and skip_nmpfams is false, the pipeline will start downloading the latest NMPFams HMM library."

	"description": ""
	"description": "NMPFams hosted link to the latest NMPFams HMM database file."

	[hmmer](https://github.com/EddyRivasLab/hmmer) is a fast and flexible alignment trimming tool that keeps phylogenetically informative sites and removes others.
	[hmmer](https://github.com/EddyRivasLab/hmmer) (HMMER) is a sequence search tool that uses profile hidden Markov models (profile HMMs) to identify homologous sequences against curated databases such as Pfam, FunFam, NMPFams and metagRoot.

-                { assert snapshot(
-                    path(workflow.out.interproscan_tsv[0][1]).readLines()[0]
-                        .contains("GI|225038609|EFDID|719595|FULL	079fff43a0270e432d339ea71b6f0acf	350	SFLD	SFLDS00057	Glutaminase/Asparaginase	17	347	0.0	T")
-                ).match()}
+        then {
+            assertAll(
+                { assert workflow.success },
+                { assert path(workflow.out.interproscan_tsv[0][1]).readLines()[0]
+                    .contains("GI|225038609|EFDID|719595|FULL\t079fff43a0270e432d339ea71b6f0acf\t350\tSFLD\tSFLDS00057\tGlutaminase/Asparaginase\t17\t347\t0.0\tT") },
+                { assert snapshot(workflow.out.interproscan_tsv).match() }
+            )
+        }

Conversation

vagkaratzas commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Added

Changed

Dependencies

PR checklist

Uh oh!

github-actions Bot commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

nf-core pipelines lint overall result: Passed ✅ ⚠️

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

Uh oh!

Aratz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pinin4fjords left a comment

Choose a reason for hiding this comment

Additional findings on lines outside this PR's diff

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pinin4fjords left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pinin4fjords left a comment

Choose a reason for hiding this comment

Uh oh!

vagkaratzas commented May 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

vagkaratzas commented May 5, 2026 •

edited

Loading

`Added`

`Changed`

`Dependencies`

github-actions Bot commented May 5, 2026 •

edited

Loading

`nf-core pipelines lint` overall result: Passed ✅ ⚠️