diff --git a/deliverables/proposals/index.md b/deliverables/proposals/index.md index 4364eb5..6e8b758 100644 --- a/deliverables/proposals/index.md +++ b/deliverables/proposals/index.md @@ -49,4 +49,9 @@ Summary: Proposal for the Foreman Probe project to create model probe tasks that ### Crimson Leaf Holdings -- Task 998dcdfe-4851-4de2-8cb6-29075f993366 Date: 2026-04-29 Status: AWAITING DAVID'S APPROVAL -Summary: Proposal for the Foreman Probe project to model probe tasks created by the Foreman to benchmark and evaluate LLM capabilities. This fills the gap in internal performance evaluation by providing a standardized testbed, differing from the general Incubation proposal by focusing specifically on technical validation metrics for the Foreman system. \ No newline at end of file +Summary: Proposal for the Foreman Probe project to model probe tasks created by the Foreman to benchmark and evaluate LLM capabilities. This fills the gap in internal performance evaluation by providing a standardized testbed, differing from the general Incubation proposal by focusing specifically on technical validation metrics for the Foreman system. + +### Crimson Leaf Holdings -- Task 16c4e89f-fd1a-4741-a0d9-0823c12d28d0 +Date: 2026-04-29 +Status: AWAITING DAVID'S APPROVAL +Summary: Proposal for the Foreman Probe project to develop specialized model probe tasks that benchmark and evaluate internal LLM capabilities. It addresses the lack of high-fidelity performance metrics by creating a standardized testing framework, distinguishing itself from the broader Incubation project by focusing on technical validation of the Foreman's reasoning engines. \ No newline at end of file