Literature Review
Overview
Conduct systematic, comprehensive literature reviews following rigorous academic methodology. Search multiple literature databases, synthesize findings thematically, verify all citations for accuracy, and generate professional output documents in markdown and PDF formats.
This skill uses the parallel-web skill (parallel-cli search) as the primary web search tool for broad academic literature discovery, supplemented by specialized database access skills (gget, bioservices, datacommons-client). It provides specialized tools for citation verification, result aggregation, and document generation.
When to Use This Skill
Use this skill when:
- Conducting a systematic literature review for research or publication
- Synthesizing current knowledge on a specific topic across multiple sources
- Performing meta-analysis or scoping reviews
- Writing the literature review section of a research paper or thesis
- Investigating the state of the art in a research domain
- Identifying research gaps and future directions
- Requiring verified citations and professional formatting
Visual Enhancement with Scientific Schematics
⚠️ MANDATORY: Every literature review MUST include at least 1-2 AI-generated figures using the scientific-schematics skill.
This is not optional. Literature reviews without visual elements are incomplete. Before finalizing any document:
- Generate at minimum ONE schematic or diagram (e.g., PRISMA flow diagram for systematic reviews)
- Prefer 2-3 figures for comprehensive reviews (search strategy flowchart, thematic synthesis diagram, conceptual framework)
How to generate figures:
- Use the scientific-schematics skill to generate AI-powered publication-quality diagrams
- Simply describe your desired diagram in natural language
- Nano Banana Pro will automatically generate, review, and refine the schematic
How to generate schematics:
python scripts/generate_schematic.py "your diagram description" -o figures/output.png
The AI will automatically:
- Create publication-quality images with proper formatting
- Review and refine through multiple iterations
- Ensure accessibility (colorblind-friendly, high contrast)
- Save outputs in the figures/ directory
When to add schematics:
- PRISMA flow diagrams for systematic reviews
- Literature search strategy flowcharts
- Thematic synthesis diagrams
- Research gap visualization maps
- Citation network diagrams
- Conceptual framework illustrations
- Any complex concept that benefits from visualization
For detailed guidance on creating schematics, refer to the scientific-schematics skill documentation.
Core Workflow
Literature reviews follow a structured, multi-phase workflow:
Phase 1: Planning and Scoping
-
Define Research Question: Use PICO framework (Population, Intervention, Comparison, Outcome) for clinical/biomedical reviews
- Example: "What is the efficacy of CRISPR-Cas9 (I) for treating sickle cell disease (P) compared to standard care (C)?"
-
Establish Scope and Objectives:
- Define clear, specific research questions
- Determine review type (narrative, systematic, scoping, meta-analysis)
- Set boundaries (time period, geographic scope, study types)
-
Develop Search Strategy:
- Identify 2-4 main concepts from research question
- List synonyms, abbreviations, and related terms for each concept
- Plan Boolean operators (AND, OR, NOT) to combine terms
- Select minimum 3 complementary databases
- Use the parallel-web skill (
parallel-cli search) for initial scoping to quickly gauge the landscape before formal database searches
-
Set Inclusion/Exclusion Criteria:
- Date range (e.g., last 10 years: 2015-2024)
- Language (typically English, or specify multilingual)
- Publication types (peer-reviewed, preprints, reviews)
- Study designs (RCTs, observational, in vitro, etc.)
- Document all criteria clearly
Phase 2: Systematic Literature Search
-
Multi-Database Search:
Select databases appropriate for the domain. Always start with parallel-web for broad academic coverage, then supplement with domain-specific databases.
Web-Based Academic Search (parallel-web skill — START HERE):
- Use
parallel-cli searchwith academic domain filtering for broad scholarly coverage - Run two searches: academic-focused + general to catch all relevant sources
# Academic-focused search across scholarly sources parallel-cli search "your research topic" -q "keyword1" -q "keyword2" \ --json --max-results 10 --excerpt-max-chars-total 27000 \ --include-domains "scholar.google.com,arxiv.org,pubmed.ncbi.nlm.nih.gov,semanticscholar.org,biorxiv.org,medrxiv.org,ncbi.nlm.nih.gov,nature.com,science.org,ieee.org,acm.org,springer.com,wiley.com,cell.com,pnas.org,nih.gov" \ -o sources/litreview_<topic>-academic.json # General search for supplementary sources parallel-cli search "your research topic" -q "keyword1" -q "keyword2" \ --json --max-results 10 --excerpt-max-chars-total 27000 \ -o sources/litreview_<topic>-general.json- Use
parallel-cli extractto fetch full content from specific paper URLs or PDFs found in search results
parallel-cli extract "https://arxiv.org/abs/XXXX.XXXXX" --jsonBiomedical & Life Sciences:
- Use
ggetskill:gget search pubmed "search terms"for PubMed/PMC - Use
ggetskill:gget search biorxiv "search terms"for preprints - Use
bioservicesskill for ChEMBL, KEGG, UniProt, etc.
General Scientific Literature:
- Search arXiv via direct API (preprints in physics, math, CS, q-bio)
- Search Semantic Scholar via API (200M+ papers, cross-disciplinary)
- Use Google Scholar for comprehensive coverage (manual or careful scraping)
Specialized Databases:
- Use
gget alphafoldfor protein structures - Use
gget cosmicfor cancer genomics - Use
datacommons-clientfor demographic/statistical data - Use specialized databases as appropriate for the domain
- Use
-
Document Search Parameters:
## Search Strategy ### Database: PubMed - **Date searched**: 2024-10-25 - **Date range**: 2015-01-01 to 2024-10-25 - **Search string**:("CRISPR"[Title] OR "Cas9"[Title]) AND ("sickle cell"[MeSH] OR "SCD"[Title/Abstract]) AND 2015:2024[Publication Date]
- **Results**: 247 articlesRepeat for each database searched.
-
Export and Aggregate Results:
- Export results in JSON format from each database
- Combine all results into a single file
- Use
scripts/search_databases.pyfor post-processing:python search_databases.py combined_results.json \ --deduplicate \ --format markdown \ --output aggregated_results.md
Phase 3: Screening and Selection
-
Deduplication:
python search_databases.py results.json --deduplicate --output unique_results.json- Removes duplicates by DOI (primary) or title (fallback)
- Document number of duplicates removed
-
Title Screening:
- Review all titles against inclusion/exclusion criteria
- Exclude obviously irrelevant studies
- Document number excluded at this stage
-
Abstract Screening:
- Read abstracts of remaining studies
- Apply inclusion/exclusion criteria rigorously
- Document reasons for exclusion
-
Full-Text Screening:
- Obtain full texts of remaining studies
- Conduct detailed review against all criteria
- Document specific reasons for exclusion
- Record final number of included studies
-
Create PRISMA Flow Diagram:
Initial search: n = X ├─ After deduplication: n = Y ├─ After title screening: n = Z ├─ After abstract screening: n = A └─ Included in review: n = B