Command Dependency

Command Dependency (CmdDependency) is the external dependent program that can be used in Command Program (CmdProgram).

Create

CmdDependency(;
    exec::Base.AbstractCmd=``,
    test_args::Base.AbstractCmd=``,
    validate_success::Bool=false,
    validate_stdout::Function=do_nothing,
    validate_stderr::Function=do_nothing,
    exit_when_fail::Bool=true
)

exec::AbstractCmd: the command to call the dependency.
test_args::AbstractCmd: for testing purposes, the command to be appended to exec.
validate_success::Bool: when checking the dependency, whether to validate the exit code == 0.
validate_stdout::Function: a function takes standard out as String and return the validation result as ::Bool.
validate_stderr::Function: a function takes standard error as String and return the validation result as ::Bool.
exit_when_fail::Bool: if validation fails, whether to throw error and exit.

Example

julia = CmdDependency(
    exec = `julia`,
    test_args = `--version`,
    validate_success = true,
    validate_stdout = x -> occursin(r"^julia version", x),
    validate_stderr = do_nothing,
    exit_when_fail = true
)

Check

CmdDependency can be checked manually by using

check_dependency(p::CmdDependency)

Usually, it is not necessary when you specify CmdDependency in CmdProgram because dependency check will be done when running CmdProgram by default.

Use with `CmdProgram`

When building CmdProgram, we can specify our command dependencies by using

dep1 = CmdDependency(...)
dep2 = CmdDependency(...)

prog = CmdProgram(
    cmd_dependencies = [dep1, dep2],
    ...
)

To call the dependency in command, we can prepend a dollor sign ($):

prog = CmdProgram(
    cmd_dependencies = [dep1, dep2],
    cmd = `$dep1 --args` & `$dep2 --args`
)

An Example

We use the same example in Command Program page.

Adding samtools and bowtie2 as the dependencies of the bowtie2 mapping program:

SAMTOOLS = CmdDependency(
    exec = `samtools`,
    test_args = `--version`,
    validate_success = true,
    validate_stdout = x -> occursin(r"^samtools \d+", x)
)

BOWTIE2 = CmdDependency(
    exec = `bowtie2`,
    test_args = `--version`,
    validate_success = true,
    validate_stdout = x -> occursin(r"bowtie2-align-s version \d+", x)
)

program_bowtie2 = CmdProgram(
    name = "Bowtie2 Mapping",
    id_file = ".bowtie2",

    cmd_dependencies = [SAMTOOLS, BOWTIE2],

    inputs = ["FASTQ", "REF"],
    validate_inputs = inputs -> begin
        check_dependency_file(inputs["FASTQ"]) &&
        check_dependency_file(inputs["REF"])
    end,

    prerequisites = (inputs, outputs) -> begin
        mkpath(dirname(to_str(outputs["BAM"])))
    end,

    outputs = ["BAM"],
    infer_outputs = inputs -> begin
        Dict("BAM" => str(inputs["FASTQ"]) * ".bam")
    end,
    validate_outputs = outputs -> begin
        check_dependency_file(outputs["BAM"])
    end,

    cmd = pipeline(`$BOWTIE2 -x REF -q FASTQ`, `$SAMTOOLS sort -O bam -o BAM`),

    wrap_up = (inputs, outputs) -> run(`$SAMTOOLS index $(outputs["BAM"])`)
)

To run the program, we can decide whether to check dependencies with run(..., check_dependencies=true):

inputs = Dict("FASTQ" => "a.fastq", "REF" => "ref.fasta")
success, outputs = run(program_bowtie2, inputs; check_dependencies = true)