Top-K - SkyDiscover

Overview

Top-K is a straightforward search algorithm that always selects the best program as the parent and uses the next best programs as context. It’s the simplest effective baseline in SkyDiscover.

Greedy Selection

Always picks the highest-scoring program to evolve

Elite Context

Provides top programs as examples in the prompt

No Stagnation

Continues refining even if improvement is slow

Minimal Overhead

Fast and simple with no complex bookkeeping

How It Works

Sampling Strategy

On each iteration, Top-K:

Parent Selection: Choose the program with the highest combined_score
Context Selection: Select the next K programs (ranks 2 to K+1) as context
Prompt Generation: Include context programs as examples of good solutions
Evolution: Generate a mutation of the best program

If only one program exists, it’s used as both parent and context.

Pure Exploitation

Top-K is a pure exploitation strategy:

No exploration of diverse solutions
No randomness in selection
Focuses entirely on refining the current best

This can be very effective for problems where iterative refinement is the key to success.

Configuration

Basic Usage

skydiscover-run initial_program.py evaluator.py \
  --search topk \
  --iterations 50

Python API

from skydiscover import run_discovery

result = run_discovery(
    initial_program="initial.py",
    evaluator="eval.py",
    search="topk",
    iterations=50,
)

print(f"Best score: {result.best_score}")

Configuration File

search:
  type: topk
  database:
    # No special configuration needed for Top-K
    # Standard database settings apply
    db_path: "outputs/topk"
  
  # Number of context programs to include in prompts
  num_context_programs: 4

Configuration Options

num_context_programs

int

default:"4"

Number of top programs (after the best) to include as context

Top-K uses the standard database configuration options:

db_path

string

Directory to save programs and checkpoints

log_prompts

bool

default:"false"

Whether to save prompts and responses

When to Use Top-K

Best For

Quick experiments and baselines
Problems where greedy refinement works well
Short discovery runs (< 50 iterations)
When you want simple, predictable behavior
As a starting point before trying more complex algorithms

Avoid When

Problems with many local optima requiring exploration
Need for diverse solution approaches
Risk of getting stuck in local optima

Comparison with Other Algorithms

Algorithm	Exploration	Exploitation	Complexity
Top-K	None	High	Low
Best-of-N	None	High	Low
Beam Search	Medium	Medium	Medium
AdaEvolve	Adaptive	Adaptive	High

Example

Simple Optimization

# evaluator.py
def evaluate(program_path):
    # Run and score the program
    result = run_program(program_path)
    return {"combined_score": result.accuracy}

# initial_program.py
def solve(data):
    # Simple baseline
    return data

# Run Top-K for 30 iterations
skydiscover-run initial_program.py evaluator.py \
  --search topk \
  --iterations 30 \
  --model gpt-5

Output Structure

outputs/topk/
├── programs/
│   ├── program_abc123.json    # Best program
│   ├── program_def456.json    # Second best
│   └── ...
├── metadata.json              # Best program ID, iteration count
└── checkpoint_30/             # Checkpoint at iteration 30

Tips for Best Results

Start Simple

Top-K is an excellent starting point. Run it first to establish a baseline before trying more complex algorithms.

Monitor for Stagnation

If the score stops improving after several iterations, consider switching to an algorithm with exploration (like AdaEvolve or Beam Search).

Use Good Context

Increase num_context_programs (e.g., to 8) to give the LLM more examples of successful solutions.

Iterate Quickly

Top-K’s simplicity makes it fast. Use it for rapid iteration during development.

Variants

You can easily modify Top-K behavior:

Random Top-K

Sample randomly from top-K instead of always using #1:

from skydiscover.search.topk import TopKDatabase
import random

class RandomTopKDatabase(TopKDatabase):
    def sample(self, num_context_programs=4, **kwargs):
        top_programs = self.get_top_programs(num_context_programs + 1)
        parent = random.choice(top_programs[:3])  # Pick from top 3
        context = [p for p in top_programs if p.id != parent.id][:num_context_programs]
        return parent, context

Top-K with Temperature

Weight selection by score:

import numpy as np

class TemperatureTopKDatabase(TopKDatabase):
    def __init__(self, name, config):
        super().__init__(name, config)
        self.temperature = 0.5
    
    def sample(self, num_context_programs=4, **kwargs):
        top_programs = self.get_top_programs(10)
        scores = [p.metrics.get('combined_score', 0) for p in top_programs]
        probs = np.exp(np.array(scores) / self.temperature)
        probs /= probs.sum()
        parent = np.random.choice(top_programs, p=probs)
        context = [p for p in top_programs if p.id != parent.id][:num_context_programs]
        return parent, context

Best-of-N - Similar greedy approach with N attempts per parent
Beam Search - Maintains multiple candidates instead of just one
AdaEvolve - Adaptive version with exploration

​Overview

Greedy Selection

Elite Context

No Stagnation

Minimal Overhead

​How It Works

​Sampling Strategy

​Pure Exploitation

​Configuration

​Basic Usage

​Python API

​Configuration File

​Configuration Options

​When to Use Top-K

​Comparison with Other Algorithms

​Example

​Simple Optimization

​Output Structure

​Tips for Best Results

​Variants

​Random Top-K

​Top-K with Temperature

​Related Algorithms

Overview

How It Works

Sampling Strategy

Pure Exploitation

Configuration

Basic Usage

Python API

Configuration File

Configuration Options

When to Use Top-K

Comparison with Other Algorithms

Example

Simple Optimization

Output Structure

Tips for Best Results

Variants

Random Top-K

Top-K with Temperature

Related Algorithms