ai-science-genomic-llms

Name: ai-science-genomic-llms
Rating: 5 (3 reviews)
Author: Pavel-Kravchenko

DevOps e Infra

Author: Pavel-Kravchenko

Genomic Foundation Models: Nucleotide Transformers, HyenaDNA, and Evo with NumPy.

3stars

Updated 3 months ago

View on GitHub ↗

How to add

/plugin marketplace add Pavel-Kravchenko/Bioinformatics

The exact command may vary by repository. Check the README on GitHub.

For the skill author

Drop this on your repo README

Shows your skill is listed on Skillteca, generates a backlink and trackable traffic.

[![Listada na Skillteca](https://www.skillteca.com.br/api/badge/ai-science-genomic-llms/svg)](https://www.skillteca.com.br/skills/ai-science-genomic-llms?utm_source=badge&utm_medium=readme&utm_campaign=badge)

#llm #ai

Related skills

See all in DevOps e Infra →

internal-comms

153.1k

Resources to assist in writing various internal communications, adhering to company-preferred formats. Claude should utilize this skill for status reports, leadership updates, newsletters, FAQs, and other internal documents.

DevOps e Infraby anthropics

babysit

83.4k

Monitors a pull request or review cycle until it is ready to merge. This skill is used to track PR comments, reviews, and CI status until all actionable issues are resolved.

DevOps e Infra#aiby thedotmack

do

83.4k

Execute a phased implementation plan using subagents. Use when asked to execute, run, or carry out a plan — especially one created by make-plan.

DevOps e Infra#aiby thedotmack

smart-explore

83.4k

Token-optimized structural code search using tree-sitter AST parsing. Use this instead of reading full files when you need to understand code structure, find functions, or explore a codebase efficiently.

DevOps e Infra#aiby thedotmack

Category alert

Get new DevOps e Infra skills every Monday

One short email with only the new DevOps e Infra skills. 4 minutes of reading, no spam, unsubscribe with one click.

You confirm your email on the first send. No spam. Unsubscribe with one click.

Genomic Foundation Models: Nucleotide Transformers, HyenaDNA, and Evo

Tokenization Strategies

Character-level (A,C,G,T,N): highest resolution, long sequences
k-mer tokens (e.g., k=6): compressed representation; k=6 → 4096-token vocab, k=8 → 65,536 — use k≤6 for explicit k-mer tokenization
BPE/subword: data-driven token units (used in some genomic LMs)
Nucleotide Transformer: 6-mer tokens, stride=1, 4096-vocab; ~L/6 tokens per sequence — loses single-nucleotide re

[Description truncada. Veja o README completo no GitHub.]

ShareX LinkedIn

Comments · No comments

No comments yet. Be the first.