Publication Cover
International Interactions
Empirical and Theoretical Research in International Relations
Volume 43, 2017 - Issue 6
976
Views
8
CrossRef citations to date
0
Altmetric
Articles

Measuring Military Effectiveness: Calculating Casualty Loss-Exchange Ratios for Multilateral Wars, 1816–1990

&
 

ABSTRACT

In this article, we introduce and utilize a new dataset that provides battle- and war- level Loss Exchange Ratios (LERs) for combatant states involved in multilateral wars between 1816 and 1990.The battle-level data provide an alternative to the widely used, but problematic, HERO/CDB-90 data set on battle outcomes. To demonstrate the utility of the new data, we weigh in on the debate over democratic military effectiveness arguments by replicating models by Reiter and Stam (2002, 2009) and Downes (2009), finding that, when effectiveness is measured using LERs, democracies do not have an edge over their non-democratic counterparts.

Notes

1 See Biddle (Citation2004:152–153) for a description of the problems with the original CDB-90 data.

2 The Peace Research Institute at Oslo (PRIO) has also collected data on casualties. However, their temporal frame only covers 1946–2008 and they do not disaggregate casualties by the parties in the dispute.

3 See the LERD codebook for a detailed description of how the wars were disaggregated.

4 Of the 2,503 Battle Records, 1,024 come from Clodfelter and 158 come from Dupuy and Dupuy. The vast majority of other sources are in university press books. A small number of books by other presses were used when insufficient university press sources were available to inform data collection on a conflict. See the full project documentation for details.

5 “Killed in Action” or “Casualties,” which includes both killed in action and wounded in action.

6 The war-level aggregated data we use in this article is available from the authors or from the journal’s replication data archive. The full LERD project data on the battle level and source level, as well as the full coding document for each observation and the various aggregation procedures, are available from the LERD Project (Long, Cochran, and Wagstaff Citation2016).

7 For example, when computing Israel’s LER for the Six Day War, Israel’s LER for the battle of Rafah is weighted much more heavily than its LER for the battle of Ammunition Hill because the total casualties for Rafah were much higher (2,500 Israeli and Egyptian KIA) than Ammunition Hill (106 Israeli and Jordanian KIA).

8 Democracies are countries with a polity score above 17. Autocracies are countries with a polity score below 5. Anocracies have polity scores between 5 and 17.

9 They do not include the constitutive Politics term because they only have two categories of states and including it would lead to multicollinearity problems (Reiter and Stam Citation2002:40–41).

10 Details on these variables can be found in Reiter and Stam (Citation2002:38–44).

11 There are other important predictors of effectiveness that are not included in this model and there are alternative ways of measuring terrain and strategy. However, the goal of this article is to replicate Reiter and Stam’s analysis using new data. Given this goal, we focus on the variables they include to test their hypotheses.

12 In order to maintain consistency with Reiter and Stam’s models, the reported errors are not clustered on war. Because observations within wars are not independent, we ran robustness tests using clustered errors. This did not change our substantive results.

13 A Shapiro Wilks test shows that we cannot reject the null hypothesis that the residuals are normally distributed (p = .297), and visual tests of the residuals demonstrate that the data closely mirror a normal distribution. We use White’s test for generalized heteoskedasticty and find that the error terms are not homogenous (p = .004). This confirms our decision to use robust standard errors (and in later models to cluster those errors on war). We also ran robustness tests to evaluate how sensitive the results were to outliers, identifying three likely outliers (Germany and Yugoslavia in the Yugoslav theatre of WWII and Jordan during the Palestinian War) and three potential outliers (Greece during WWI, Israel during the War of Attrition, and Russia during the Sino-Soviet War). Exclusion of these cases individually or together did not affect the results.

14 He also rescales the polity variable to 1–21 rather than −10 to 10.

15 As with Reiter and Stam’s model, the Shapiro Wilks test shows we cannot reject the null hypothesis (p = .166). Visual tests confirm that the distribution closely approximates a normal distribution. This model assumes heteroskadistic errors clustered on war. An analysis of the homoskedasticity assumption using an identical model with non-clustered errors supports this decision (p = .094 for White’s test). The results from this model are insensitive to the exclusion of outliers.

16 See Reiter and Stam (Citation2002:41).

17 The Shapiro Wilks test shows that we cannot reject the null hypothesis of normally distributed errors (p = .52) though there is evidence of heteroskedasticity (p value for White’s test is .008). That is to be expected, given the interdependence between observations from the same war. The use of robust clustered errors alleviates this problem. As before, these results are insensitive to the exclusion of outliers.

18 We disaggregate the Western Front into two wars between the Western Powers and Germany and the Western Powers and Turkey. The allied participants differed, with Russia fighting with the allies in the latter conflict but not the former. The conflicts took place in two entirely separate theatres and separate peaces were signed at different times: the Treaty of Versailles between Germany and the Western Powers in June of 1918 and the Treaty of Sèvres between Turkey and the Western Powers in August of 1920.

19 This disaggregation generally follows Reiter and Stam’s (Citation2002) example. We depart from their definitions by combining the Norway and Denmark conflicts and the France, Belgium, and Holland conflicts. The outcomes of these conflicts were inextricably linked. In fact, Germany’s operational planning considered the relevant conflicts all part of one battle: Operation Weserübung for Norway and Denmark and Fall Gelb for the invasion of France and the low countries.

20 For example, in calculating the figures for the Battle of Doiran in WWI, Sondhaus (Citation2011) figures only provide casualty figures for Britain and Greece, even though France was also involved in the battle on the allies’ side. Using the LER from his account would artificially inflate Britain and Greece’s LER since the Bulgarian’s killed by the French would be attributed to them. Similarly Bulgaria’s LER would be underestimated since it would not be given credit for the French forces they killed. In the case, Clodfelter (Citation2008) and Hall (Citation2010) provide casualty figures for all of the battle participants and these figures are used to calculate the LER.

21 For example, for the Battle of Cambrai in WWI, we have a record for the overall battle and a record for an engagement that happened during the first day of the battle. The larger battle is coded as MULTIPLE, the smaller engagement as SUBSUMED, and the engagement is excluded.

22 For example, we have an observation for Operation Junction City in the Vietnam War, which was fought between the US and an NVA/VC team. This Operation includes six engagements including the initial attack, Ap Gu, Prek Klok I, Prek Klok II, and Suoi Tre. Four of these engagements provide LERs for the US vs. VC and two provide an LER for the US vs. NVA. When we aggregated the data, we included the engagements because they provide the most detailed information on how the NVA and VC performed when fighting the US. The overall Operation Junction City observation is excluded to avoid double-counting.

23 See footnote 18.

24 See footnote 19.

25 It should be noted that this LER is not created by taking the average LER across sources because averaging ratios can be problematic. Rather, this LER is created by taking the average casualties for side a (across sources) and dividing it by the average casualties for side b (across sources). The casualties are averaged first and then the LER is calculated from those averaged numbers.

26 See footnote 4.

27 See footnote 5.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.