Author in the catalog

jaechang-hits

202 skills37,572 stars totalgithub.com/jaechang-hits

Published skills

Showing 48 of 202

lamindb-data-management

An open-source FAIR biology data framework that versions artifacts, tracks lineage, validates via ontologies (Bionty), and queries datasets. It integrates with Nextflow, Snakemake, W&B, and scVI, and recommends scanpy for scRNA-seq and bionty for ontology lookups.

Dados e Análise#aiby jaechang-hits

cellchat-cell-communication

186

Infer and visualize intercellular communication from scRNA-seq data using CellChat (R) or liana (Python). The workflow involves building CellChat, analyzing ligand-receptor pairs, gene expression, communication probabilities, pathway signaling, and network centrality, with visualizations and cross-condition comparisons for human and mouse data.

Design e Frontend#python#aiby jaechang-hits

smina-molecular-docking

186

smina is a molecular docking CLI, a fork of AutoDock Vina featuring customizable scoring functions, native SDF/MOL2/PDB ligand input, autoboxing, and local energy minimization. Choose smina over Vina when you need customization in molecular docking.

DevOps e Infra#aiby jaechang-hits

torchdrug

186

A PyTorch-based ML platform for drug discovery, supporting graph molecular representation learning, property prediction (ADMET, activity), retrosynthesis, and drug-target interaction. It offers GNN layers, pretrained models, and benchmark datasets.

Dados e Análise#aiby jaechang-hits

unichem-database

186

Cross-reference compound IDs across 20+ databases (e.g., ChEMBL, DrugBank) via UniChem REST API, resolving InChIKeys to source IDs, translating between source-specific IDs, and finding structurally related compounds by connectivity. All cross-reference queries use POST with a JSON body, except for /sources (GET), and no authentication is required.

Dados e Análise#ai#apiby jaechang-hits

zinc-database

186

Query ZINC15/ZINC22 virtual compound libraries (1.4B compounds, 750M purchasable) to search for lead, fragment, or drug-like compounds by MW, logP, reactivity, or SMILES similarity, and download 3D sets for docking. For bioactivity, use chembl-database-bioactivity; for approved drugs, use drugbank-database-access.

Dados e Análise#ai#reactby jaechang-hits

brenda-database

186

BRENDA Enzyme DB offers SOAP/REST queries for over 80,000 enzymes and 7 million values, covering kinetic parameters (Km, Vmax, kcat, Ki), EC classes, substrate specificity, inhibitors, cofactors, and organism data. Free academic registration is available, and it integrates with cobrapy-metabolic-modeling and hmdb-database.

Dados e Análise#aiby jaechang-hits

pysam-genomic-files

186

Read/write SAM/BAM/CRAM, VCF/BCF, FASTA/FASTQ. Region queries, pileup, variant filtering, read groups. Python htslib wrapper exposing samtools/bcftools CLI. Use STAR/BWA for alignment; GATK/DeepVariant for variant calling.

DevOps e Infra#python#aiby jaechang-hits

samtools-bam-processing

186

CLI toolkit for SAM/BAM/CRAM: sort, index, convert, filter, QC alignments. Core commands: view, sort, index, flagstat, stats, depth, markdup, merge. Required between alignment and variant/peak calling. Use pysam for Python-native BAM access; deeptools for normalized coverage tracks.

Design e Frontend#python#aiby jaechang-hits

pylabrobot

186

Hardware-agnostic Python liquid-handler library: portable scripts run on Hamilton STAR, Tecan Freedom EVO, Opentrons OT-2, or a simulator without vendor lock-in. For protocol automation, method dev, plate reformatting, serial dilutions, and Python lab workflows.

Automação#python#aiby jaechang-hits

star-rna-seq-aligner

186

Splice-aware RNA-seq aligner producing sorted BAM and splice junction tables. Builds genome index, runs two-pass alignment for better junctions. Outputs sorted BAM, junctions (SJ.out.tab), stats (Log.final.out), optional gene counts. Use Salmon for fast pseudoalignment; STAR when a BAM is needed for variant calling, IGV, or ENCODE pipelines.

Design e Frontend#aiby jaechang-hits

statsmodels-statistical-modeling

186

Python statistical modeling: regression (OLS, WLS, GLM), discrete (Logit, Poisson, NegBin), time series (ARIMA, SARIMAX, VAR), with rigorous inference, diagnostics, and hypothesis tests. Use scikit-learn for ML; statistical-analysis for test choice.

Desenvolvimento#git#pythonby jaechang-hits

cellpose-cell-segmentation

186

DL cell/nucleus segmentation for fluorescence and brightfield microscopy. Pre-trained models (cyto3, nuclei, tissuenet) and a generalist flow-based algorithm segment cells without retraining. Outputs label masks for morphology and tracking. Use scikit-image watershed for rule-based; Cellpose when DL generalization across staining is needed.

Escrita e Conteúdo#aiby jaechang-hits

histolab-wsi-processing

186

WSI processing for digital pathology. Tissue detection, tile extraction (random, grid, score-based), filter pipelines for H&E/IHC. For dataset prep, tile-based DL, slide QC. Use pathml for multiplexed imaging.

Dados e Análise#git#aiby jaechang-hits

pyimagej-fiji-bridge

186

Python bridge to ImageJ2/Fiji for macros, plugins (Bio-Formats, TrackMate, Analyze Particles), NumPy↔ImagePlus/ImgLib2 exchange, and ImageJ Ops. Automates Fiji headlessly from Python. Use scikit-image for pure Python without Fiji plugins; napari for visualization.

Dados e Análise#python#aiby jaechang-hits

scikit-image-processing

186

Python image processing for microscopy and bioimage analysis. Read/write images, filter (Gaussian, median, LoG), segment (thresholding, watershed, active contours), measure region properties, detect features. SciPy/NumPy ecosystem. Use OpenCV for real-time video; CellPose for DL cell segmentation; napari for visualization.

Dados e Análise#python#aiby jaechang-hits

plotly-interactive-plots

186

Interactive scientific visualization with Plotly. Two APIs: plotly.express (px) for one-liner DataFrame plots, plotly.graph_objects (go) for trace-level control. 40+ chart types with hover, zoom, pan, animation. Exports HTML or static PNG/SVG/PDF via kaleido. Use for volcano plots with gene hover, dose-response dashboards, expression heatmaps, 3D molecular views. Use seaborn for stats; matplotlib

Dados e Análise#pdf#aiby jaechang-hits

elife-figure-guide

186

eLife figure preparation: file formats (TIFF/EPS/PDF), striking image requirements (1800x900 px), figure supplement naming, and image screening policy treating selective enhancement as misconduct.

Documentos#pdf#aiby jaechang-hits

scientific-visualization

186

Guide for choosing and creating scientific visualizations for publications and talks. Covers chart-type selection by data structure, color theory for accessibility/print, figure composition, journal formatting (Nature, Cell, ACS), and common pitfalls. Consult when visualizing data or preparing submission figures.

Dados e Análise#aiby jaechang-hits

seaborn-statistical-plots

186

Statistical visualization on matplotlib with native pandas support. Auto aggregation, CIs, grouping for distributions (histplot, kdeplot), categorical (boxplot, violinplot), relational (scatterplot, lineplot), regression (regplot, lmplot), matrix (heatmap, clustermap), grids (pairplot, FacetGrid). Use for quick statistical summaries; matplotlib for fine control; plotly for interactive HTML.

Dados e Análise#aiby jaechang-hits

plotly-interactive-visualization

186

Interactive visualization with Plotly. 40+ chart types (scatter, line, heatmap, 3D, geographic) with hover, zoom, pan. Two APIs: Plotly Express (DataFrame) and Graph Objects (fine control). For static publication figures use matplotlib; for statistical grammar use seaborn.

Dados e Análise#ai#apiby jaechang-hits

seaborn-statistical-visualization

186

Statistical visualization on matplotlib + pandas. Distributions (histplot, kdeplot, violin, box), relational (scatter, line), categorical, regression, correlation heatmaps. Auto aggregation/CIs. Use plotly for interactive; matplotlib for low-level.

Dados e Análise#aiby jaechang-hits

single-cell-annotation

186

Best practices for single-cell RNA-seq cell type annotation including marker-based, reference-based, and automated classification approaches.

DevOps e Infra#aiby jaechang-hits

statistical-analysis

186

Guided statistical analysis: test choice, assumption checks, effect sizes, power, APA reporting. Pick tests, verify assumptions, or format results for publication. Covers frequentist (t-test, ANOVA, chi-square, regression, correlation, survival, count, reliability) and Bayesian. Use statsmodels or pymc-bayesian-modeling to fit.

Desenvolvimento#ai#testby jaechang-hits

statistical-significance-annotation

186

Guide for annotating statistical significance (p-value asterisks) on comparison plots. Covers standard notation (ns, *, **, ***, ****), matplotlib bracket+asterisk implementation, and use with seaborn box/violin/bar plots. Use when preparing publication-ready figures with significance markers.

Design e Frontend#aiby jaechang-hits

sciagent-skill-creator

186

Scaffold a new SciAgent-Skills entry. Picks pipeline/toolkit/database/guide template, creates skills/{category}/{name}/SKILL.md with valid frontmatter, appends the registry.yaml entry, runs validation. Enforces name uniqueness, kebab-case, description keyword rules, schema rules from CLAUDE.md. TRIGGER when user says (any language): "add a SciAgent skill", "add a skill for <X>", "create new skill"

Documentos#ai#wordby jaechang-hits

opentrons-integration

186

Opentrons Protocol API v2 for OT-2/Flex: Python protocols for pipetting, serial dilutions, PCR, plate replication; control thermocycler, heater-shaker, magnetic, temperature modules. Use pylabrobot for multi-vendor.

Automação#python#aiby jaechang-hits

pymc-bayesian-modeling

186

Bayesian modeling with PyMC 5: priors, likelihood, NUTS/ADVI sampling, diagnostics (R-hat, ESS), LOO/WAIC comparison, prediction. Hierarchical, logistic, GP variants; predictive checks.

DevOps e Infra#aiby jaechang-hits

scikit-survival-analysis

186

Time-to-event modeling with scikit-survival: Cox PH (elastic net), Random Survival Forests, Boosting, SVMs for censored data. C-index, Brier, time-dependent AUC; Kaplan-Meier, Nelson-Aalen, competing risks. Pipeline/GridSearchCV compatible. Use statsmodels for frequentist, pymc for Bayesian, lifelines for parametric.

Dados e Análise#aiby jaechang-hits

bwa-mem2-dna-aligner

186

Fast short-read DNA aligner for WGS/WES/ChIP-seq. 2× faster BWA-MEM successor; outputs SAM/BAM with read group headers for GATK. Primary plus supplementary records for chimeric reads. Use STAR for RNA-seq splice-aware alignment; Bowtie2 is a comparable alternative.

Marketing#aiby jaechang-hits

flowio-flow-cytometry

186

Parse/write FCS (Flow Cytometry) files v2.0-3.1. Events as NumPy, channel metadata, multi-dataset files, CSV/FCS export. Use FlowKit for gating/compensation.

Dados e Análise#aiby jaechang-hits

napari-image-viewer

186

Interactive viewer for microscopy. Displays 2D/3D/4D arrays as Image, Labels, Points, Shapes, Tracks layers; supports annotation, plugin analysis, headless screenshots. Core visualization for Python bioimage workflows. Use ImageJ/FIJI for macro processing; napari for Python-native interactive visualization and DL segmentation review.

Dados e Análise#python#aiby jaechang-hits

opencv-bioimage-analysis

186

Computer vision for bio-image preprocessing, feature detection, real-time microscopy. Color conversion, morphology, contour/blob detection, template matching, optical flow on fluorescence/brightfield. 10-100× faster than pure Python via C++. Use scikit-image for scientific morphometry/regionprops; OpenCV for real-time, video, classical feature extraction.

Escrita e Conteúdo#python#aiby jaechang-hits

trackpy-particle-tracking

186

Python library for single-particle tracking (SPT) in video microscopy via the Crocker-Grier algorithm. Locate particles (fluorescent spots, colloids, vesicles, cells) per frame, link into trajectories, filter short tracks, and compute MSD for diffusion analysis. 2D/3D with subpixel accuracy; reads TIF stacks, AVI, image series via pims. Use for quantitative SPT and diffusion coefficient extraction

Escrita e Conteúdo#python#aiby jaechang-hits

matplotlib-scientific-plotting

186

Low-level Python plotting for scientific figures: publication-quality line, scatter, bar, heatmap, contour, 3D; multi-panel layouts; fine control of every element. PNG/PDF/SVG export. Use seaborn for quick stats, plotly for interactive.

Design e Frontend#python#pdfby jaechang-hits

monarch-database

186

Monarch Initiative knowledge graph REST API for disease-gene-phenotype associations and cross-species orthology. MONDO disease-to-gene/phenotype, HP phenotype profiles, cross-species comparisons. Use for rare disease gene prioritization and phenotype-based candidate ranking. For GWAS use gwas-database; for clinical pathogenicity use clinvar-database.

Dados e Análise#ai#apiby jaechang-hits

bakta-genome-annotation

186

Annotate bacterial and archaeal genomes and plasmids with Bakta's Prodigal/HMM/diamond pipeline. Identifies CDS, ncRNA, tRNA, rRNA, tmRNA, sORFs, CRISPR arrays, oriC/oriV/oriT, and gaps against a curated UniRef-derived database. Produces NCBI-compatible GFF3, GenBank, EMBL, JSON, FASTA, TSV, and a circular genome plot. Use Prokka for legacy pipelines or non-bacterial kingdoms; PGAP for NCBI GenBan

DevOps e Infra#aiby jaechang-hits

biopython-sequence-analysis

186

Biopython sequence analysis: parse FASTA/FASTQ/GenBank/GFF (SeqIO), NCBI Entrez (esearch/efetch/elink), remote/local BLAST, pairwise/MSA alignment (PairwiseAligner, MUSCLE/ClustalW), phylogenetic trees (Phylo). Use for gene family studies, phylogenomics, comparative genomics, NCBI pipelines. For PCR/restriction/cloning use biopython-molecular-biology; for SAM/BAM use pysam.

DevOps e Infra#python#aiby jaechang-hits

cbioportal-database

186

Cancer genomics (TCGA et al.) via cBioPortal REST API. Retrieve somatic mutations, CNAs, expression, clinical data (survival/stage/treatment) across thousands of studies. Use for TMB, oncoprints, survival analysis. For population frequencies use gnomad-database; for drug-gene interactions use opentargets-database.

Dados e Análise#ai#apiby jaechang-hits

clinpgx-database

186

Query the ClinPGx (formerly PharmGKB) REST API plus the CPIC PostgREST companion API for pharmacogenomic clinical annotations, CPIC/DPWG dosing guidelines, gene-drug pairs, variant-drug associations, FDA/EMA drug labels, and PGx pathways. Two-host architecture: api.clinpgx.org for annotation records, api.cpicpgx.org for genotype→recommendation lookups. No auth. For germline pathogenicity use clinv

Dados e Análise#ai#apiby jaechang-hits

clinvar-database

186

Query NCBI ClinVar via E-utilities for variant clinical significance, pathogenicity, disease associations. Search by gene/rsID/condition/review status; returns ClinSig, submitter data, conditions, HGVS. For GWAS use gwas-database; for variant consequence prediction use Ensembl VEP.

Dados e Análise#aiby jaechang-hits

dbsnp-database

186

Query NCBI dbSNP for SNP records by rsID, gene, or region via E-utilities and Variation Services REST API. Retrieve alleles, MAF, variant class (SNV/indel/MNV), clinical links, cross-DB IDs (ClinVar, dbVar, 1000G). Free; 3 req/sec (10 with key). For clinical pathogenicity use clinvar-database; for population frequencies use gnomad-database.

Dados e Análise#ai#apiby jaechang-hits

ena-database

186

ENA REST API for sequences, reads, assemblies, and annotations. Portal API search, Browser API retrieval (XML/FASTA/EMBL), file reports for FASTQ/BAM URLs, taxonomy, cross-refs. For multi-DB Python use bioservices; for NCBI-only use pubmed-database or Biopython Entrez.

Dados e Análise#python#aiby jaechang-hits

gget-genomic-databases

186

Unified CLI/Python interface to 20+ genomic databases. Gene lookups (Ensembl search/info/seq), BLAST/BLAT, AlphaFold, Enrichr enrichment, OpenTargets disease/drug, CELLxGENE single-cell, cBioPortal/COSMIC cancer, ARCHS4 expression. Spans genomics, proteomics, disease. For batch/advanced BLAST use biopython; for multi-DB Python SDK use bioservices.

Dados e Análise#python#aiby jaechang-hits

gwas-database

186

NHGRI-EBI GWAS Catalog REST API for SNP-trait associations from published GWAS. Query studies, associations, variants, traits, genes, summary stats. Build PRS candidates, analyze pleiotropy, fetch stats for Manhattan plots. No auth.

Dados e Análise#ai#apiby jaechang-hits

kegg-database

186

KEGG REST API (academic only). Pathways, genes, compounds, enzymes, diseases, drugs via 7 ops (info/list/find/get/conv/link/ddi). ID conversion (NCBI/UniProt/PubChem). Use bioservices for multi-DB Python.

Dados e Análise#python#aiby jaechang-hits

prokka-genome-annotation

186

Annotate prokaryotic genomes (bacteria, archaea, viruses) via Prokka's BLAST/HMM pipeline. Identifies CDS, rRNA, tRNA, tmRNA, signal peptides against Pfam, TIGRFAMs, RefSeq. Outputs GFF3, GenBank, FASTA, TSV. Use PGAP for NCBI GenBank submission; Bakta for faster NCBI-compatible annotation.

DevOps e Infra#aiby jaechang-hits

roary-pangenome

186

Compute the bacterial pan-genome from Prokka/Bakta GFF3 annotations with Roary's CD-HIT + BLAST + MCL clustering pipeline. Builds gene presence/absence matrices, core/soft-core/shell/cloud partitions, multi-FASTA core gene alignments (with `-e`), and a pan-genome reference. Use Panaroo for higher-accuracy pan-genomes from highly fragmented assemblies, PIRATE for paralog-aware clustering, or PPanGG

DevOps e Infra#aiby jaechang-hits

Category alert

Published skills

lamindb-data-management

cellchat-cell-communication

smina-molecular-docking

torchdrug

unichem-database

zinc-database

brenda-database

pysam-genomic-files

samtools-bam-processing

pylabrobot

star-rna-seq-aligner

statsmodels-statistical-modeling

cellpose-cell-segmentation

histolab-wsi-processing

pyimagej-fiji-bridge

scikit-image-processing

plotly-interactive-plots

elife-figure-guide

scientific-visualization

seaborn-statistical-plots

plotly-interactive-visualization

seaborn-statistical-visualization

single-cell-annotation

statistical-analysis

statistical-significance-annotation

sciagent-skill-creator

opentrons-integration

pymc-bayesian-modeling

scikit-survival-analysis

bwa-mem2-dna-aligner

flowio-flow-cytometry

napari-image-viewer

opencv-bioimage-analysis

trackpy-particle-tracking

matplotlib-scientific-plotting

monarch-database

bakta-genome-annotation

biopython-sequence-analysis

cbioportal-database

clinpgx-database

clinvar-database

dbsnp-database

ena-database

gget-genomic-databases

gwas-database

kegg-database

prokka-genome-annotation

roary-pangenome

Get new Dados e Análise skills every Monday