Writing Environment (RL Story Generation)

This repository contains a lightweight environment for training and evaluating creative-writing models using reinforcement learning. It includes components for generating stories, evaluating them using rubric-based scoring, and configuring training runs.

Overview

The environment is designed for experimenting with RLHF pipelines for story generation.
A model produces a story based on a prompt, and a judge evaluates the output according to structured rubrics.
The judge returns a numerical reward along with rubric-specific scores.

Components

1. writing_bench.py

Runs the main writing-generation loop.
It is responsible for:

Loading tasks from JSON files
Sampling prompts and running generation episodes
Handling batch size, sequence length, rollouts, and sampling settings
Returning outputs suitable for RL training or evaluation workflows

2. writing_judge.py

Implements the rubric-based evaluation system.
The judge performs the following:

Reads task metadata and rubric definitions
Checks objective requirements (such as word count, POV, tense, genre, etc.)
Applies subjective scoring using a judge model
Produces a final numeric reward and a detailed rubric breakdown

3. Task Files

This repository includes two task sets:

generated_tasks8_train.json
generated_tasks8_eval.json

Each task contains:

A story prompt
Required stylistic or structural attributes
Rubric definitions used by the judge for scoring

These files are used during training and evaluation, respectively.

4. writing_configs.toml

Defines configuration settings for:

Model selection
Checkpoint intervals
Batch sizes and rollout parameters
Sequence lengths and sampling settings
Environment parameters (such as which task file to load)
Optimizer and LoRA settings

This file serves as the central configuration point for training.

Running the Environment

Run the writing generation process and judge process:

python writing_bench.py --config writing_configs.toml

python writing_judge.py --tasks generated_tasks8_eval.json

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
README.md		README.md
generated_tasks7_eval.json		generated_tasks7_eval.json
generated_tasks7_train.json		generated_tasks7_train.json
writing_bench.py		writing_bench.py
writing_configs.toml		writing_configs.toml
writing_judge.py		writing_judge.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Writing Environment (RL Story Generation)

Overview

Components

1. writing_bench.py

2. writing_judge.py

3. Task Files

4. writing_configs.toml

Running the Environment

About

Uh oh!

Releases

Packages

Languages

PrimeIntellect-ai/creative_writing

Folders and files

Latest commit

History

Repository files navigation

Writing Environment (RL Story Generation)

Overview

Components

1. writing_bench.py

2. writing_judge.py

3. Task Files

4. writing_configs.toml

Running the Environment

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages