ralph-wiggum-bdd/scripts/ralph-wiggum-bdd.sh at main · marcindulak/ralph-wiggum-bdd · GitHub

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
#!/usr/bin/env bash

# Copyright 2026 https://github.com/marcindulak/ralph-wiggum-bdd
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#     http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

set -Eeuo pipefail

PROMPT='You are implementing a project using Behavior-Driven Development with Gherkin.

The instructions provided below have been approved by the CEO, so follow them.

Always read and follow CLAUDE.md.

## CRITICAL: ELN.md Read/Write Rules

ALWAYS READ ELN.md AT THE START OF THIS SESSION.
ELN.md is an append-only Electronic Lab Notebook.
You may ONLY write to ELN.md after successfully committing code.
You must ALWAYS use ELN.md as input to decision-making at any point.

## Discovery

1. Read REQUIREMENTS.md to understand what needs to be built.
2. Check if features/ directory exists.

## Stage 1: Generate Features (if features/ does not exist)

1. Analyze REQUIREMENTS.md and identify discrete testable requirements (both functional and non-functional ones).
2. Create features/ directory.
3. For each requirement group, create a numbered feature file (001.feature, 002.feature, etc.).
4. Each feature file must have:
   - @status-todo tag at the top
   - Feature title with number and brief description
   - Scenarios with concrete Given/When/Then steps (no vague language)

### Feature Consistency Check (BLOCKS PROGRESS)

After generating features, verify:
- REQUIREMENTS.md ↔ Features: Do features cover all requirements? Do features contradict REQUIREMENTS.md?
- Contradiction Detection: No two scenarios have identical (Given, When) but conflicting Then outcomes.
- Overlapping Triggers: No identical When clauses with different Then clauses unless intentional.
- Testability: Every Then clause is concrete and verifiable, not vague like "works well".

If issues found, stop and ask human to clarify or update features and/or REQUIREMENTS.md.

5. Commit the feature files using this format:

   Generate features from REQUIREMENTS.md

   Why:
   [1-2 sentences explaining what features were generated]

   Caveats:
   None

6. After committing, append an entry to ELN.md with observations and learnings. Include also negative learnings, like details of failed experiments. Add the new entry AT THE END OF THE FILE. Use the same format as Stage 2 (see Complete section).

7. MANDATORY STOP: Exit immediately after creating feature files. Do NOT proceed to Stage 2. Do NOT implement any code. The next iteration will handle implementation.

## Stage 2-6: Implement Features (if features/ exists)

### Feature Consistency Check (MANDATORY GATE - BLOCKS PROGRESS)

Before selecting a feature to implement, ALWAYS verify:
- REQUIREMENTS.md ↔ Features: Do features cover all requirements? Do features contradict REQUIREMENTS.md?
- Contradiction Detection: No two scenarios have identical (Given, When) but conflicting Then outcomes.
- Overlapping Triggers: No identical When clauses with different Then clauses unless intentional.
- Testability: Every Then clause is concrete and verifiable, not vague like "works well".

If issues found, stop and ask human to clarify or update features and/or REQUIREMENTS.md.

### Check for Uncommitted Work

Before finding the next feature, check git status:
1. If there are uncommitted changes AND tests pass, commit them immediately using the commit format.
2. If there are uncommitted changes AND tests fail, fix the failures first.
3. Only proceed to finding next feature after all changes are committed.

### Find Next Feature

1. If a feature has @status-active, resume that feature ONLY.
2. If multiple @status-active exist, pick the one with uncommitted changes or lowest number, set others to @status-todo.
3. If no @status-active, analyze @status-todo features and select the SINGLE highest-priority feature based on:
   - Dependencies (implement prerequisites first)
   - Complexity (simpler features first when no dependencies)
   - Numerical order (as fallback)
4. Change selected feature tag to @status-active.
5. CRITICAL: Work on ONLY THIS ONE FEATURE for this entire iteration. Do not implement other features.

### Implement

1. Modify ONLY the @status-active feature.
2. Create step definitions in features/steps/ for undefined steps in this feature.
3. Reuse existing step definitions when the step text matches.
4. Implement code in src/ to make the scenarios of the @status-active feature pass.
   MANDATORY: ALL implementation code must be placed in src/ directory. Never place code in the project root or other directories.
5. MANDATORY: Run ALL tests using the test framework. Do not skip this step.
   - Verify that ALL scenarios across ALL features pass.
   - If other features fail after your changes, debug and fix to make all tests pass.
6. Remove any temporary debugging files before committing. Never commit debug or temporary files.

### Sync Verification

Before committing, run tests and verify bidirectional consistency:
- REQUIREMENTS.md ↔ Features: Do features cover all requirements? Do features contradict requirements?
- Features ↔ Steps: Do step definitions correctly implement the Gherkin steps?
- Steps ↔ Code: Does the implementation match what steps expect?
- Code ↔ All Features: Run ALL scenarios to verify they pass. All tests must pass before proceeding.

IMPORTANT - Bidirectional Sync:
When inconsistencies are found, DO NOT assume which direction to sync.
Ask the user: "Should I update [X] to match [Y], or update [Y] to match [X]?"
Examples:
- Code changed but tests fail: Ask whether to fix code OR update features/REQUIREMENTS
- REQUIREMENTS changed but features mismatch: Ask whether to update features OR revert REQUIREMENTS

### Implementation Consistency Check (BLOCKS PROGRESS - MANDATORY)

This is a hard requirement. Do not proceed without it.
Run the complete test suite. All scenarios across all features must pass.
If any scenario fails, debug and fix before proceeding.
Do not mark @status-done unless the tests verify all scenarios pass.

### Complete

1. ONLY after the test suite runs and ALL scenarios pass: change the @status-active feature tag to @status-done.
   If other features now pass as a result of your changes, mark those as @status-done too.

2. Commit all related file changes together (REQUIREMENTS.md, feature, steps, src, ...) using EXACTLY this format:

   Implement Feature NNN: Brief Title

   Why:
   [1-2 sentences explaining what this feature does, not that tests pass]

   Caveats:
   [Breaking changes, migration paths, or impacts. If none, write "None"]

   Example:
   ```
   Implement Feature 001: Temperature Conversion

   Why:
   Provides bidirectional temperature conversion between Celsius and Fahrenheit.

   Caveats:
   None
   ```

3. MANDATORY: After committing, you MUST append an entry to ELN.md (Electronic Lab Notebook) with observations and learnings.
   Do not skip this step. Add the new entry AT THE END OF THE FILE (after all existing entries).
   The entry must follow this format EXACTLY:

   ```
   ## Start of ENTRY AAA

   **DATE:** YYYY-MM-DD HH:MM:SS
   **TITLE:** Implement Feature NNN: Brief Title
   **COMMIT:** <commit-sha>

   ### DECISIONS

   [Chosen approach and why. Document ALL design decisions:
   code, infrastructure, tooling (e.g., test framework, package manager),
   configuration (e.g., database engine), and deployment. Each decision on its own line.
   CRITICAL: For any decision that relates to or builds on existing ELN entries, you MUST explicitly reference the ENTRY number.
   Example: "Following ENTRY 001 decision to use Python, chose Framework X because..."
   or "Reconsidered ENTRY 003 decision about Y, now using Z instead because..."]

   ### ALTERNATIVES

   [Each alternative on its own line with the reason for rejection.
   If none were considered, explain why the decision was constrained.]

   ### OBSERVATIONS

   [Things noticed that cannot yet be classified as a decision or alternative.
   Unexpected behaviors, anomalies, or open questions that may become relevant
   in future iterations. Do NOT put status statements like "all tests pass" here.
   Do NOT repeat descriptions of DECISIONS or ALTERNATIVES here.
   If there is nothing to record, write "None".]

   ## End of ENTRY AAA
   ```

   Example:
   ```
   ## Start of ENTRY 002

   **DATE:** 2024-01-19 14:23:45
   **TITLE:** Implement Feature 001: Temperature Conversion
   **COMMIT:** a1b2c3d4e5f6g7h8i9j0k1l2m3n4o5p6q7r8s9t

   ### DECISIONS

   - Used simple arithmetic formulas for conversion — fast, maintainable, no dependencies.
   - Used Python 3.12 slim container image — smaller than full image, sufficient for this project.

   ### ALTERNATIVES

   - Considered lookup tables (rejected: inflexible for arbitrary input values).
   - Considered polynomial approximation (rejected: unnecessary complexity for exact formulas).
   - Considered Alpine-based container (rejected: musl libc compatibility issues with some Python packages).

   ### OBSERVATIONS

   - The conversion formula loses precision for values near float64 limits — may need investigation if high-precision use cases arise.

   ## End of ENTRY 002
   ```

   If ELN.md does not exist, create it with a header explaining it is an append-only log.
   IMPORTANT: ELN.md is NOT committed - it remains a local non-committed file.

4. STOP HERE. Do NOT proceed to the next feature.
   Tell the user: "Feature NNN is complete and committed. Type /exit to end this iteration and start the next feature with fresh context, or tell me to continue if you want to proceed in this session." (replace NNN with the actual just completed feature number, e.g., 001)
   Wait for user to exit, or explicit user decision to continue.

## MANDATORY: End of Iteration

THIS ITERATION IS COMPLETE. You must not:
- Select or work on another feature
- Implement additional features beyond the @status-active feature you just completed
- Commit additional features
- Continue beyond this point

If there are uncommitted changes for other features marked @status-done as side effects, do not commit them yet. Wait for the next iteration.

The script will start a new iteration when the user returns.

## Rules

- One feature per iteration. STOP after completing or when blocked. Do NOT continue to next feature.
- All scenarios and tests must pass before any commit. This is non-negotiable. Run the test suite and verify.
- After ANY commit (even one requested explicitly by the user), you MUST append an entry to ELN.md. No exceptions.
- ELN.md entries must document ALL design decisions, not only code-level choices: include infrastructure, tooling, configuration, and deployment decisions.
- ELN.md entries must include ALTERNATIVES section: document rejected alternatives if any were considered, or explain why the decision was constrained.
- Never modify REQUIREMENTS.md without human approval.
- Stop and ask for clarification if requirements are ambiguous or contradictory.
- After committing a feature, STOP and tell user to type /exit for fresh context.
- If you have not run tests and verified all scenarios pass, you cannot mark @status-done or commit.
'


usage() {
    echo ""
    echo "Usage: $0 <options>"
    echo ""
    echo "Options:"
    echo "  --help             Show this help message"
    echo "  --interactive      Use interactive Claude mode (human controls when to stop)"
    echo "  --iterations N     Maximum number of agent iterations (non-interactive mode)"
    echo "  --no-local-context Do not use the local electronic notebook file (ELN.md) in the agent context"
    echo "  --prompt           Custom prompt string, e.g., \"You're absolutely right!\", or \"\$(cat prompt.md)\""
    exit 1
}

MAX_ITERATIONS=""
INTERACTIVE_MODE=false
LOCAL_CONTEXT=true

while [[ $# -gt 0 ]]; do
    case "$1" in
        --help)
            usage
            ;;
        --interactive)
            INTERACTIVE_MODE=true
            shift
            ;;
        --iterations)
            if [[ -z "$2" ]] || ! [[ "$2" =~ ^[0-9]+$ ]]; then
                echo "Error: --iterations requires a positive integer"
                exit 1
            fi
            MAX_ITERATIONS="$2"
            shift 2
            ;;
        --no-local-context)
            LOCAL_CONTEXT=false
            shift
            ;;
        --prompt)
            if [[ -z "$2" ]]; then
                echo "Error: --prompt requires a string" >&2
                exit 1
            fi
            PROMPT="$2"
            shift 2
            ;;
        *)
            echo "Error: Unknown option: $1"
            usage
            ;;
    esac
done

if [[ "${INTERACTIVE_MODE}" == "true" ]] && [[ -n "${MAX_ITERATIONS}" ]]; then
    echo "Error: --iterations and --interactive are mutually exclusive"
    echo "In interactive mode, the human controls when to stop"
    usage
fi

if [[ "${INTERACTIVE_MODE}" == "false" ]] && [[ -z "${MAX_ITERATIONS}" ]]; then
    echo "Error: --iterations is required in non-interactive mode"
    usage
fi

if [[ "${INTERACTIVE_MODE}" == "true" ]]; then
    MAX_ITERATIONS=1
    PROMPT_FILE="/tmp/ralph-wiggum-bdd-prompt-$$.txt"
fi

if [[ "${LOCAL_CONTEXT}" == "false" ]]; then
    PROMPT="${PROMPT//ALWAYS READ ELN.md AT THE START/NEVER READ ELN.md AT THE START}"
    PROMPT="${PROMPT//You must ALWAYS use ELN.md as input to decision-making/You must NEVER use ELN.md as input to decision-making}"
fi

round10() {
    local n=$1
    echo $(( (n + 5) / 10 * 10 ))
}

print_iteration_time() {
    local iteration_start=$1
    local i=$2
    if [[ -n "${iteration_start}" ]]; then
        local elapsed=$((SECONDS - iteration_start))
        echo "" >&2
        echo "--- End of iteration ${i} --- (${elapsed}s)" >&2
    fi
}

on_signal() {
    local signal=$1
    local iteration_start=$2
    local i=$3
    local prompt_file=$4

    print_iteration_time "$iteration_start" "$i"
    [[ -n "${prompt_file}" ]] && /bin/rm -f "${prompt_file}"

    local signal_num
    case "$signal" in
        INT) signal_num=2 ;;
        TERM) signal_num=15 ;;
    esac
    exit $((128 + signal_num))
}

trap '[[ -n "${PROMPT_FILE:-}" ]] && /bin/rm -f "${PROMPT_FILE}"' EXIT
trap 'on_signal INT "${iteration_start:-}" "$i" "${PROMPT_FILE:-}"' INT
trap 'on_signal TERM "${iteration_start:-}" "$i" "${PROMPT_FILE:-}"' TERM

echo "ralph-wiggum-bdd: Starting with max ${MAX_ITERATIONS} iterations"
echo "----------------------------------------"

for ((i = 1; i <= MAX_ITERATIONS; i++)); do
    echo ""
    echo "Iteration ${i}/${MAX_ITERATIONS}"
    echo "----------------------------------------"

    iteration_start=$SECONDS

    echo "DEBUG: About to invoke claude..." >&2
    echo -n "DEBUG: Approximate PROMPT length: $(round10 ${#PROMPT}) characters" >&2
    echo -n ", $(round10 $(set -- $PROMPT && echo $#)) words" >&2
    echo ", $(round10 $(( ($(set -- $PROMPT && echo $#)*3)/2 ))) tokens" >&2
    echo "DEBUG: Interactive mode: ${INTERACTIVE_MODE}" >&2

    if [[ "${INTERACTIVE_MODE}" == "true" ]]; then
        printf '%s' "${PROMPT}" > "${PROMPT_FILE}"
        echo "" >&2
        echo "To start execution, copy-paste @${PROMPT_FILE} in Claude" >&2
        echo "" >&2
        claude
    else
        echo "${PROMPT}" | claude --print --dangerously-skip-permissions || true
    fi

    print_iteration_time "$iteration_start" "$i"
done