noodls browser compatibility check

The security settings of your browser are blocking the execution of scripts.

To use noodls, javascript support must be enabled. Please change your browser's security settings to enable javascript.

If you have changed your browser's security settings, you can click here.

related announcements

04/17/2025 | Press release | Distributed by Public on 04/17/2025 11:58

Remote Visualization

Published April 17, 2025

CSIS Futures Lab research indicates that some widely used models (e.g., Llama 8B, Gemini 1.5, and Qwen2) choose escalatory responses in the benchmark study, compared to models like Claude, GPT, Llama 70B, and Mistral, that chose a decrease in conflict intensity. These discrepancies likely stem from differences in training data and fine-tuning practices.
All eight large language models (LLMs) recommend more escalatory responses for the United States, United Kingdom, and France, while offering fewer recommendations for escalation to China and Russia.
To safeguard decisionmaking, governments and agencies must invest in comprehensive evaluation frameworks and institute routine audits of AI models. Adopting tools like Futures Lab's CFPD-Benchmark can help identify and correct these biases before deployment-ensuring that AI supports strategic objectives while minimizing unintended risks.