Thank you for your effort for this impressive tool. I'm running this tool with multiple 10x genomics single cell data. However, the process is suddenly halted before the step for DGE and differential isoform usage.
This is my full script to run SCALPEL:
#!/bin/bash
# Set directory to save temporal files
# (if these are not set, then nextflow make those files in home directory - memory issue)
export NXF_HOME=/node200data/dlchdms0109/.nextflow
export NXF_JAVA_TMPDIR=/node200data/dlchdms0109/tmp
export NXF_CONDA_CACHEDIR=/node200data/dlchdms0109/conda
# clean cache first
rm -rf /node200data/dlchdms0109/.nextflow/assets/
/home/dlchdms0109/.local/bin/nextflow run /home/dlchdms0109/Isoform_imputation/sr_tx_analysis/scalpel/SCALPEL/main.nf \
-work-dir /node200data/dlchdms0109/run_SCALPEL/work \
--sequencing chromium \
--samplesheet /node200data/dlchdms0109/run_SCALPEL/samplesheet_for_SCALPEL.csv \
--transcriptome /home/dlchdms0109/Isoform_imputation/Reference_genome/gencode.v49.transcripts.fa.gz \
--gtf /home/dlchdms0109/Isoform_imputation/Reference_genome/gencode.v49.primary_assembly.annotation.gtf \
--ipdb /home/dlchdms0109/Isoform_imputation/Reference_genome/ipdb/hg38_ipriming_sites.bed.tar.gz \
--cpus 30 \
-with-conda /home/dlchdms0109/Isoform_imputation/sr_tx_analysis/scalpel/SCALPEL/requirements.yml \
--outputDir /node200data/dlchdms0109/run_SCALPEL/scalpel_output
I describe my sample sheet file below:
PO13,/node200data/dhkim1674/SC_SR_Samsung_Alzheimer/fastq/PO_/Samsung_hBrain_PO13_NC_GEX/Samsung_hBrain_PO13_NC_GEX_S1_R1_001.fastq.gz,/node200data/dhkim1674/SC_SR_Samsung_Alzheimer/fastq/PO_/Samsung_hBrain_PO13_NC_GEX/Samsung_hBrain_PO13_NC_GEX_S1_R2_001.fastq.gz,/node200data/dhkim1674/SC_SR_Samsung_Alzheimer/cellranger_output/PO13/
When I checked the output directory, iDGE_seurat.RDS and DIU_table.csv are not made.
This is the error state when I run the tool:
Error in `vec_rep_each()`:
! `times` can't be missing. Location 1 is missing.
Backtrace:
▆
1. ├─... %>% ...
2. ├─tidyr::pivot_wider(...)
3. ├─tidyr:::pivot_wider.data.frame(...)
4. │ └─tidyr::pivot_wider_spec(...)
5. │ └─vctrs::vec_rep_each(fill, nrow * ncol)
6. └─rlang::abort(message = message, call = call)
Warning message:
In nrow * ncol : NAs produced by integer overflow
Execution halted
I didn't understand why those processes are suddenly interrupted
Thank you for your effort for this impressive tool. I'm running this tool with multiple 10x genomics single cell data. However, the process is suddenly halted before the step for DGE and differential isoform usage.
This is my full script to run SCALPEL:
I describe my sample sheet file below:
When I checked the output directory, iDGE_seurat.RDS and DIU_table.csv are not made.
This is the error state when I run the tool:
I didn't understand why those processes are suddenly interrupted