Orchestration Log: When this skill is activated, append a log entry to outputs/orchestration_log.md:

### Skill Activation: Literature Engine
**Timestamp:** [current date/time]
**Actor:** AI Agent (literature-engine)
**Input:** [brief description of the search/synthesis request]
**Output:** [brief description of results — e.g., "47 papers found across 4 databases, deduplicated to 38"]

Literature Engine

Data Source Priority — STRICT

Academic APIs via scripts/academic_search.py — ALWAYS first
- Semantic Scholar (200M+ papers): citation graphs, TLDR, snowballing
- OpenAlex (474M+ works): broadest coverage, German publications, CC0
- CrossRef (150M+ works): DOI resolution, journal metadata
- arXiv (2.4M+ preprints): CS/AI/ML cutting-edge
Web search — ONLY for: VHB-JOURQUAL rankings, specific CFPs, conference info
Firecrawl — LAST RESORT for sources without APIs (AIS eLibrary, specific repositories)

Search Strategy

Step 1: Query Construction

For any research topic, construct 4-6 search queries:

Query Type	Purpose	Example (GenAI/Agents paper)
Core English	Main topic	"generative AI enterprise implementation"
Synonym English	Alternative terms	"large language models organizational adoption"
Narrow English	Specific aspect	"autonomous AI agents business process"
Adjacent English	Related field	"AI transformation strategy organizational change"
German	German publications	"generative KI Unternehmen Implementierung"
Theoretical	Theory-specific	"sociotechnical systems artificial intelligence"

Step 2: Execute Search

from scripts.academic_search import search_all, search_semantic_scholar, search_openalex, snowball, deduplicate_papers, papers_to_csv, papers_to_bibtex_file

# Round 1: Broad multi-API search
papers = search_all("generative AI enterprise implementation", 
                    max_results_per_source=20, year_from=2020)
papers += search_all("autonomous AI agents organizational", 
                     max_results_per_source=20, year_from=2020)
papers += search_all("LLM adoption business strategy", 
                     max_results_per_source=15, year_from=2022)

# Round 2: German sources via OpenAlex
papers += search_openalex("generative KI Implementierung Unternehmen", 
                          max_results=15, year_from=2020)

# Deduplicate
papers = deduplicate_papers(papers)
print(f"After dedup: {len(papers)} unique papers")

Step 3: Snowball Key Papers

# Sort by citations, snowball top 5
top = sorted(papers, key=lambda p: -(p.get("citation_count") or 0))[:5]
for p in top:
    if p.get("doi"):
        result = snowball(p["doi"], direction="both", limit=15)
        papers.extend(result.get("forward", []))
        papers.extend(result.get("backward", []))

# Also snowball seminal papers you know are relevant
seminal_dois = [
    # Add known seminal papers here, e.g.:
    # "10.2307/25148667",  # DeLone & McLean IS Success
]
for doi in seminal_dois:
    result = snowball(doi, direction="forward", limit=20)
    papers.extend(result.get("forward", []))

papers = deduplicate_papers(papers)

Step 4: Export & Save

# Save for further analysis
papers_to_csv(papers, "literature_base.csv")
papers_to_bibtex_file(papers, "references.bib")

Screening

Quick Screening (for building a paper's literature base)

Not a full SLR — just filter the most relevant papers:

Year range appropriate?
Published in a reputable venue?
Title/abstract clearly relevant to the research questions?
Methodology aligned with what we're looking for?

Formal SLR Screening

Use scripts/screening.py for systematic reviews:

from scripts.screening import screen_title_abstract, save_screening_results, generate_prisma_counts

results = screen_title_abstract(
    papers=papers,
    include_keywords=["generative AI", "LLM", "AI agent", "implementation", "adoption", "organization"],
    exclude_keywords=["medical imaging", "drug discovery", "protein folding"],
    min_year=2020,
    require_abstract=True
)
save_screening_results(results, "screening/")
print(generate_prisma_counts(results))

SLR Protocol Template (PRISMA-aligned)

## Systematic Literature Review Protocol

### Research Questions
- RQ1: [Primary question]
- RQ2: [Secondary question]

### Search Strategy
- Databases: Semantic Scholar, OpenAlex, CrossRef, arXiv [+ manual: AIS eLibrary, Google Scholar]
- Search terms: [list all queries]
- Date range: [YYYY] to [YYYY]
- Language: English [+ German if applicable]

### Inclusion Criteria
- IC1: [criterion]
- IC2: [criterion]
- IC3: [criterion]

### Exclusion Criteria
- EC1: [criterion]
- EC2: [criterion]

### Quality Assessment
- [criteria: VHB B+, peer-reviewed, minimum citation threshold for older papers]

### Data Extraction Categories
- [list what to extract from each paper]

### Synthesis Method
- [Webster & Watson concept matrix / thematic synthesis / meta-analysis / vote counting]

Organizing Literature: The Concept Matrix

The single most important tool for turning a pile of papers into a structured literature review (Webster & Watson, 2002).

How to Build One

| Source | [Concept A] | [Concept B] | [Concept C] | [Concept D] | Method | Context |
|--------|------------|------------|------------|------------|--------|---------|
| Author1 (Year) | ✓ | ✓ | | | Survey | Enterprise |
| Author2 (Year) | | ✓ | ✓ | | Case Study | SME |
| Author3 (Year) | ✓ | | | ✓ | DSR | Healthcare |

Example for the GenAI/Agents paper:

Source	Implementation Strategy	Org. Change	Technical Architecture	Governance & Ethics	Human-AI Interaction	Method	Context
[Author1]	✓	✓				Case Study	Large Corp
[Author2]			✓	✓		DSR	Enterprise
[Author3]	✓				✓	Survey	Multi-industry

What the Matrix Tells You

Columns with many ✓: Well-researched concepts → summarize, don't deep-dive
Columns with few ✓: Gaps → opportunity for contribution
Row patterns: Which studies are most comprehensive?
Method column: What methods dominate? What's underrepresented?
Context column: Which contexts are studied? Which are missing?

Narrative Synthesis

Turning the Concept Matrix into a Literature Review Section

For each concept (column), write one subsection:

[Concept Name] has received [considerable/growing/limited] attention in the literature.

[Dominant finding:] The majority of studies [find/suggest/demonstrate] that 
[main finding] (Author1, Year; Author2, Year; Author3, Year).

[Nuance/contradiction:] However, [Author4] (Year) argues that [alternative view], 
particularly in [context]. This is echoed by [Author5] (Year) who found [supporting 
evidence for the alternative].

[Your position:] These mixed findings suggest that [your interpretation]. 
In particular, [specific aspect] warrants further investigation, especially 
in the context of [your study's context].

Quality Indicators for Paper Selection

Journal Rankings (German academic context)

VHB-JOURQUAL 3: A+, A, B (minimum for good lit review), C (selective)
AIS Senior Scholars' Basket of Eight: EJIS, ISJ, ISR, JAIS, JIT, JMIS, JSIS, MISQ
ABS/AJG: 4*, 4, 3 (secondary reference)

For AI/GenAI Topics (fast-moving field)

Preprints acceptable for very recent developments (arXiv, SSRN)
Industry reports (McKinsey, Gartner, Deloitte) useful for motivation, not for theoretical claims
Conference papers from ICIS, ECIS, AAAI, NeurIPS useful for cutting-edge
Recency premium: For GenAI topics, 2023+ papers carry extra weight

literature-engine

How to add

Drop this on your repo README

Related skills

understand-dashboard

understand-chat

understand-domain

dev-browser

Get new Pesquisa e Web skills every Monday