1,719
Views
2
CrossRef citations to date
0
Altmetric
Articles

Rewards and penalties in an evolutionary game theoretic model of international environmental agreements

ORCID Icon, ORCID Icon, &
Pages 602-621 | Received 11 May 2020, Accepted 13 May 2021, Published online: 22 Jun 2021

Abstract

The UNFCC on climate change specifies that all nations must follow the rule of ‘common’ with differentiation regarding their responsibilities for the protection of the global environmental system. Recently, the formulation and stability of the IEA have been increased in the literature by applying the concept of game theory to make the climate agreements successful at the national and the international level. This study provides a novel evolutionary game theoretic model of self-enforcing IEA to overcome the free rider problem. The fundamental difference between our paper and existing literature is that we examine enforcement within a model as IEA has a governing authority while the typical model of enforcement involves a government enforcing a rule that it has imposed. For this purpose, we assign countries into different grades according to their pollution levels, consider a combination of rewards and penalties, use replicator dynamics to derive the conditions for the population steady state, and examine how the proposed regulatory mechanism fares in this steady state. This framework enables us to avoid the free rider and renegotiation problems as well as the rationality assumption. We establish the condition for evolutionary stability. The global environmental problem is managed effectively as a reward-punishment scheme and the monitoring frequency of IEA fulfills this condition. Our results provide an allocation principal with stable conditions under which countries get more benefits by monitoring the IEA and stability of the grand coalition holds.

JEL CODES:

1. Introduction

The global increase in greenhouse gases has put pressure on countries to design and re-design policies to mitigate climate change effectively. However, what is best individually may not be optimal at global level. One reason is the existence of the free rider problem regarding the abatement efforts. This problem is analogous to the provision of public goods. There have been attempts to achieve an effective aggregate emission reduction target via international cooperation. The international agenda for mitigation and sustainable development becomes more specific with Paris Accord and 2030 agenda for sustainable development which reflects the recognition that can mutually achieve the goals of sustainable development and climate mitigation (Ivanova et al. Citation2020). Climate justice addresses the disproportionate effect of climate change on population and community sovereignty (Tramel Citation2016) and researchers have agreed that climate change and global warming’ main source is energy consumption (Ahmad et al. Citation2017, Citation2019; Ahmad & Du, Citation2017). Two dimensions of climate justice have been seen: ‘distributional (i.e., who is affected by climate change and who benefits from, and pays for, adaptation and mitigation policy)’ and ‘procedural (i.e., whose voice is heard in decisions)’. Hence, access and allocation problem in climate change mitigation are very complex. International Environmental Agreements (IEA) were recently modified and enacted globally for more effective collaboration among countries (Günther & Hellmann Citation2017).Footnote1 However, the free rider problem still exists. In this study, we develop a dynamic evolutionary game theoretic model of pollution abatement with a global component. This model is unlike those suggested in the literature that relies on punishment strategies, where monitoring is incomplete and costly. Therefore, this paper contributes to a ‘reward and penalty scenario’ as a possible policy instrument in which countries decide on their emission levels and IEA decides on whether to monitor countries emissions. The model can be used for management of common environmental resources like the global climate. We use the concept of repeated games for stability analysis of the self-enforcing IEA. In our model, IEA is a strategic profile that aims at controlling pollution levels in the dynamics of a repeated game to maximize the countries’ joint payoff. We focus on the simple strategy where punishment is executed only if the country is polluting more than the agreed level. Apart from that, if the countries reduce their pollution levels conforming to the environmental standard, then they get the rewards settled by the IEA. Stability in IEA means that no country has an incentive to deviate from cooperation or to re-negotiate. The IEA is stable when the equilibrium conditions are satisfied.

Most of the international agreements are monitored by cost compliance.Footnote2 For example, the Montreal Protocol has a Committee that is responsible to gather information on the compliance of the countries which are suspected to have violated (or be violating) the Protocol (Benedick Citation1998). Similarly, according to the Kyoto protocol, a Compliance Committee has the responsibility to monitor the compliance behavior of the signatory countries and then report their behavior to the Enforcement Branch, which has the power to impose sanctions (Depledge Citation2000). However, CITES tries to make a similar provision, which can monitor the compliance of the signatories through a Standing Committee with its own managerial body that has the responsibility to enforce the specifications of the agreement.Footnote3 The enforcement provisions included in ‘CITES, Montreal and Kyoto Protocol’ allow their respective managerial bodies to monitor the parties’ behavior in the case of violation ofany of the treaties. Nevertheless, the design and structure of enforcement institutions, which, while being different across IEA, they do have a common characteristic: that the cost of monitoring activities must be borne by the parties. In this sense, our work provides a new extension of reward that helps the institutions to observe and finance the monitoring activities in an effective manner.

In literature (Barrett Citation1994; Carraro & Siniscalco Citation1993), Nash equilibrium was found by one shot-game. It is observed that there is Payoff advantage to IEA participation when participation is low and a Payoff advantage to free-riding when participation is high. IEAs membership are formed by internal and external stability from cartel literature (D’Aspremont et al. Citation1983). However, Barrett (Citation1994) proves that with significant gains from cooperation, there is only one benefit to join an IEA when participation is very low. McGinty (Citation2006) relaxes the assumptions of symmetry in Barrett (Citation1994) and find greater participation in the form of tradable pollution permits when transfers are implemented. However, Barrett (Citation1999, Citation2001) considers a linear IEA model through a stage game. In Barrett (Citation1999) identical countries has either constant or increasing return to abatement. Barrett (Citation2001) introduces benefit asymmetry by considering constant return to abatement through a prisoner dilemma with a dominant strategy of pollute into a stage game. In 1st stage countries decide whether to participate in IEA or not. In 2nd stage, signatory countries collectively choose to pollute or abate and non-signatory countries individually choose to pollute or abate. However, it is assumed that signatory countries choose to get maximum payoff and comply for this decision. This result is different from Barrett (Citation1994) in which one signatory who decide to pollute cannot result others countries to pollute. Then question arises that what type of IEA that is obtained in Barrett (Citation1999, Citation2001) through simple linear framework when game is that of partial participation, return to abatement vary, externalities differ both within and across population and time assumption of stage game are relaxed. Barrett (Citation2001) tries to highlight that how transfers among signatories countries can improve IEAs. An equilibrium is obtained for full participation through lower benefit countries and greater participation through high benefit countries in the case where ‘cooperation is for sale’. This paper allows a linear evolutionary game and allows stable equilibrium and model the special IEA and allows the externalities that are differ within and across populations.

A stable environmental agreement is crucial for the control of any kind of pollution at the global level. Although there is a large body of literature on emission mitigation strategies, there seems to be more to explore the development of a global environmental governing mechanism. Barrett (1994) seems to be the first who applied it to the analyses of IEA and examined their relations. However, the terminology ‘self-enforcing’ seems to be misleading because it does not highlight the enforcement of compliance on the members of an agreement. Most researchers who apply this concept just argue that parties of a ‘self-enforcing’ IEA comply totally with the terms of the agreements (e.g., Barrett, Citation1994, Citation2003; Carraro & Siniscalco, Citation1993, Citation1998; Hoel, Citation1992). Such a dynamic mechanism can be an effective tool that motivates countries to reduce their pollution levels. However, there is the issue of how to make the countries agree to adopt a governing mechanism of this sort. The reward-penalty scenario in the model incorporates the agreement on the governing mechanism in which IEA plays as a leading role. Environmental regulations in our model refer to the IEA as a governing authority to impose penalties on countries whose pollution exceeds the agreed levels. This kind of regulation relies on a simple rule and may lead to satisfactory results. Dividing the countries into two groups according to pollution standards can make the countries strive for the joint environmental objective and not for individual gains. However, the ‘only penalty but no reward’ scenario will make the countries see the environmental regulations as a burden. To overcome the evasion of supervision, secret filming and cover-ups happen now and then. Our model considers two aspects. First, we divide the countries into different graded categories according to their pollution levels and develop an evolutionary game model for this population. Second, we introduce the reward and punishment scheme such that reporting their pollution level truth fully in the first place makes the countries better off in equilibrium. This, we believe, is an important contribution of our model to the literature. The evolutionary perspective is adopted not because it seems suitable to capture the idea of survival in the face of environmental problems such as global warming, but because it enables us to avoid the rationality assumption in traditional game theory. According to our knowledge, incorporating both aspects is new to the literature.

This work adopts a different method to study the rewards and penalties by IEA on the basis of monitored activity that is a part of the established international agreements. However, our work maintains the self-enforcing agreements to find out the equilibrium of parties to an IEA. Therefore, to reduce the incentives of a party to attempt to violate the terms and conditions of an IEA, they finance an authorized body which has the power of monitoring extent to which the behavior of any party involved complies to what has been agreed on. The parties to an IEA that are monitored and deemed in violation of the agreement incur a costly sanction. There is also a large literature on the economics of enforcing environmental policies. Some contributions on the topic of enforcing emission standards, tradable emission permits, and emission taxes include Arguedas (Citation2008) and Stranlund (Citation2007). The fundamental difference between our work and the law enforcement literature is that we examine enforcement within a model as IEA has ruling authority, while the typical model of enforcement involves a government enforcing a rule that it has imposed.

The results of our model have some key policy implications. The IEA must impose the rules that will enforce the countries to lower their pollution to the agreed level. Therefore, IEA rules must specify the consequences of penalty as a punishment in cases where the countries deviate from their emission reduction targets; otherwise, the countries will get a reward. This reward-penalty scenario indeed appears to be the best path to control pollution at global level. Our model can also serve as a benchmark that could be helpful in the face of different global environmental problems. Furthermore, our model can be used towards a better understanding of the free rider problem.

The rest of the paper is organized as follows: Sec. 2 discusses where our study stands in the literature. In Sec. 3, we introduce and develop the evolutionary game model that focuses on the stability analysis of the IEA and introduces the equilibrium point. Section 4 explains the results of this paper and allocation criteria are also discussed. Finally, Sec. 5 discusses policy implications and possible extensions to this study.

2. Literature review

Game theoretic models are widely used in the literature to analyze International Environmental Agreements (IEA) on air pollution. Starting with Barrett (Citation1994), some scholars studied the free-riding problem by applying the concept of one-shot or repeated games. Günther and Hellmann(Citation2017) and Benchekroun and Long (Citation2012) provide excellent reviews of recent game theoretic models in environmental economics. However, the formal mechanisms for climate change cooperation are discussed in Hovi et al.(Citation2015). In their review of climate change cooperation models, Hovi et al.(Citation2015) paint a bleak picture of global cooperation, but also imply that large and stable coalitions may be possible. Koo and Hong (Citation2017) have commented that various efforts have been made for greenhouse gas reduction where carbon reduction and energy performance certificate can be achieved through voluntary national energy saving campaign. They have proposed four incentives and four penalty programs by considering three criterion such as building, community and national level. On the other hand, researchers like Pavone (Citation2018) have documented that CO2 emissions are in rise steadily and efforts should be made to counter the problem. Our study aims to specify the conditions under which this may occur.

Considering global pollution as a repeated game, plenty of studies treated IEA as the strategies of coalitions that will maximize utility for participants that are committed to reducing their pollution levels. In most of these studies, all member countries are involved in the punishment of the deviator. However, the non-cooperative equilibrium cannot be improved on due to only a few countries cooperating to implement such punishment (Barrett Citation1994, Citation1997). To reduce the incentive of renegotiation, some scholars introduced punishment strategies. Asheim and Holtsmark(Citation2009) and Froyn and Hovi (Citation2008), for example, introduce the ‘penance’ strategies that are modified with ‘penance-k’ strategies in Farrell and Maskin (Citation1989), where a subgroup of k-players punish a deviator for a finite number of periods until the countries revert to cooperation. Our model differentiates itself from the existing literature by incorporating both reward and penalty strategies. This is a novel structure to model the IEA.

An enforcement model for global pollution control was proposed in Heitzig et al. (Citation2011). The linear compensation in their model dynamics does not provide efficient punishments, but a redistribution scheme of liabilities, based on the level of compliance with the previously committed levels, is required. This kind of setup allocates the targeted distribution and is an efficient solution for full cooperation with strong stability properties. They did not apply the concept of renegotiation to keep pollution levels on steady basis and showed that any allocation of optimal total payoff can be sustained as a subgame perfect equilibrium. They indicate a need for incorporating bounded rationality in future work. The evolutionary game theory approach enables us to avoid the rationality and the common knowledge of rationality assumptions.

Some of the environmental models treat pollution as a flow variable. However, it may be acceptable for short-lived climate gases and it requires the imposition of assumed simplification of greenhouse gases that remain in the climate for a long time. Rubio and Casino (Citation2005) and Rubio and Ulph (Citation2007) were the first studies that applied IEA for a pollution stock. The literature that treats pollution as stock is detailed in Calvo and Rubio (Citation2013). The stability conditions for environmental agreements are studied by Kratzsch, Sieg, and Stegemann (Citation2012) in which emission stock builds over time and payoff depends on pollution level. They showed that it is possible to enforce international cooperation via ‘penance-k’ strategies. Although we treat pollution as a flow variable in our model, we also discuss how our results fare when pollution is considered as stock.

The application of evolutionary game theory to environmental pollution problems has been picked up recently in the literature. For example, Zhang et al. (Citation2014) investigates the optimal controlled strategies by using two-stage programmed model of regional water pollution. The paper of Miao et al. (Citation2014) applied an ‘interval-fuzzy De Novo programming’ model to study the analysis of optimal allocation for water resources. Similarly, the optimal designs to manage the regional energy problems are studied by Suo et al. (Citation2013) and Zhang et al.(Citation2014). We do not resort to this approach because optimal pollution control strategies derived from the optimization methods are appropriate at the regional level. Instead, we combine the blame game of Ellingsen and Östling (Citation2011) with a rewards-penalties scenario at the global level to examine IEA.

We find a few evolutionary game theoretic models that deal with the global pollution problem in a standardized manner for IEA. Breton et al. (Citation2010) develop a dynamic evolutionary game to examine which countries join the IEAs in the equilibrium condition. In this model, only signatory countries impose the punishment to non-signatory countries on some cost and introduce a mechanism to report how the countries reach a stable IEA. This model characterizes stable partial cooperation within the IEA over a period and captures the situation in which all countries participate in a stable agreement. McGinty (Citation2010) introduces an evolutionary equilibrium in which no signatory country prefers to remain outside the IEA and the evolutionary equilibrium is robust to trembles. He finds a unique interior evolutionary equilibrium in two populations with a decreasing return in abatements and small asymmetries in externalities within the populations. He concludes that the IEA’s predictions on the polluter pay and the ability to pay principles result in a Pareto inferior outcome. In the light of literature on evolutionary game theoretic models for international environmental agreements (IEA), our model introduces an extension in the reward and penalty scenario that overcomes the free rider problem and suggests a regulatory mechanism that can mitigate global pollution effectively. Bollen et al. (Citation2009) introduce a cost-benefit analysis where some pollution control policies can generate extra benefits for climate change mitigation. Following their recommendation, we formulate a ‘reward and punishment’ scenario that allows controlling pollution at the global level. Our evolutionary game theoretic model presents a policy suggestion that makes participants better off.

3. Modeling design

3.1. Penalty-reward mechanism

In this paper, we are interested to investigate and understand that how the addition of Reward-penalty mechanism in IEAs can affect countries decision to participate. In this sense, cost-benefit calculation derives countries participation decision (Bernauer et al. Citation2010, Citation2013; Roberts et al. Citation2004; Spilker & Koubi Citation2016). For joining an IEA, countries bind themselves through international law to obligate an agreement. Although, such obligation is costly (Downs et al. Citation1996). This kind of cost includes not only direct cost to implement an agreed policyFootnote4, but also includes indirect cost for the lose of sovereignty and autonomy. Joining an IEA also results in transaction cost due to cooperation and coordination with other treaties when a conflict arise (For example, Bernauer et al. Citation2013; Downs et al. Citation1996; Spilker & Koubi Citation2016). Literature also provides evidence that such costs decrease countries willing to join an IEA (Hathaway Citation2003; Roberts et al. Citation2004). While, some studies show that treaties feature can reduce such costs (for example, availability of assistance) that will increase countries participations level (Spilker & Koubi Citation2016). To increase countries participation level, Asheim and Holtsmark (Citation2009) and Froyn and Hovi (Citation2008) introduce penalties strategies. For example, ‘penance’ strategies that are modified with ‘penance-k’ strategies in Farrell and Maskin (Citation1989) is a subgroup of k-players which will punish a deviator for a finite number of periods until the countries revert for cooperation. We add to this literature by examining whether the inclusion of reward-penalty mechanism adds or reduces to treaty participation to make an IEA more successful

In order to encourage countries to control pollution, we assume that IEA not only punishes countries based on their pollution level but also rewards those that meet the pollution emission standards under MPC. Let us suppose that the IEA has to encourage the countries not only by means of penalty but also by providing rewards to those countries that reached the targeted pollution reduction level settled by IEA. Suppose that IEA divides the countries into different ranks by their pollution levels. The IEA dividing scheme will be as follows: if the country’s pollution level is lower than the target level, then this is denoted by E1. Acountry’s pollution level increases by a fixed amount to levels denoted as E2,,Ei,,EK, where EiN+ and then the pollution emission strategy set is written as L=(E1,E2,,EK). Then, the IEA imposes a penalty ofEiE11 on those countries that take the strategy Ei, where ε is a punishment amount and (EiE11) determines the extent of punishment.

In our game, there are N countries that are free to make the decision to choose Ei. The strategy set of the jth country isδj, where δjL. L belongs to the higher effort level of the countries for pollution reduction. It means that the countries can freely choose their pollution emission strategies from . Hence, the strategy set of N countries is shown asS=(δ1,δ2,,δN). Suppose that E_=min(δ1,δ2,,δN) and E¯=max(δ1,δ2,,δN). The reward-penalty mechanism can be described as follows: giving a reward γ for countries that meet environmental standards and imposing a penalty γ on the most serious polluters, then, the rewards and penalties for the countries that adopt the strategy Ei are indicated by the function,γ.Ri. Where, Ri is a reward-penalty indicator as in EquationEq. (1) below: (1) Ri={1 δi=E_1 δi=E¯EiE¯ else(1)

Figure 1. Countries’ strategies for reward and penalty. Source: Author’s own calculation based on evolutionary game model assumptions.

Figure 1. Countries’ strategies for reward and penalty. Source: Author’s own calculation based on evolutionary game model assumptions.

3.2. Evolutionary game modeling on IEA’s monitoring for pollution control

In order to tackle the global pollution problem, suppose there are N countries in the evolutionary game model. Our evolutionary game theoretic model is based on the reward-penalty scenario. The countries freely make their decisions according to the principle of maximizing their benefits. Set the proportion of countries that take the strategy of Ei in all N countries as Xi in period t; then, the proportion vector that depicts the countries’ pollution emission situation can be written as X={X1,X2,,XK}, wherei=1KXi=1. A country adopting the Ei strategy obtains a payoff of Γ(Ei). If the IEA applies the monitoring strategies, then the countries suffer from the graded penalty (EiE1).ε and gain the reward-penalty compensations γ.Ri. Then, the utility functionπA(Ei) of a country adopting the Ei strategy that will be monitored by the IEA is as follows: (2) πA(Ei)=Γ(Ei)(EiE11).ε+γ.Ri(2)

Similarly, the utility function πB(Ei) of a country adopting the Ei strategy that is not monitored by the IEA is as follows: (3) πB(Ei)=Γ(Ei)(3)

It is very simple to prove that all countries adopt the high efforts level in ‘unique pure strategy equilibrium’. The detailed proof of the pure strategy equilibrium can be found in Propositions 1 and 2 and dynamic equilibrium conditions of our game model can be found in the Appendix.

Proposition 1.

The ‘unique pure strategy equilibrium’ of an Evolutionary Game will be (K, K,, K).

Proof.

Let η<εC. For any strategy profile with E_<K, the payoff is given to a country for choosing Ei=E_+η is πi=Γ(E_+η)(E_+ηE11).ε+γ.Ri, which exceeds πi=Γ(Ei)(EiE11).ε+γ.Ri. For a strategy profile in which E_=K, then there is a unique best response of all countries which choose to play K. Therefore, the ‘unique pure strategy equilibrium’ will be (K,K,,K).

In discrete versions of the Evolutionary game, the conditions of a ‘unique equilibrium’ are a bit more restrictive. Otherwise, if the conditions are met, we can relax the solution concept. This paper considers particular finite strategies set (1, 1+k,.,Kk,K) with K>0.

Proposition 2:

Suppose ε>Ck. The only strategic profile that will survive iterated elimination of a strictly dominated strategy will be (K,K,,K).

Proof.

The proof tries to show that the low effort level is strictly dominated by the second low effort level. Once the low effort level is eliminated, the second low effort level is strictly dominated by the third, and so on. To prove this, suppose that E<K is a low effort level which will not be eliminated yet. In this sense, this shows that playing Ei=E is strictly dominated.

By playing Ei=E=E+k,playing E yields the payoff πi=Γ(E)(EE11).ε+γ.Ri irrespective of what the other players do (since effort levels below E have been eliminated). Playing E gives payoff πi=Γ(E)(E+kE11).ε+γ.Ri if E_=E and πi=Γ(E+k)(E+kE11).ε+γ.Ri if E_=E. Clearly, playing E strictly dominates playing E as long as εε>Ck. This process of elimination continues until all effort levels, except E=K, are eliminated.

The IEA sets the monitoring cost and probability as C and P at time t, respectively. The pollution control cost is denoted as φ(Ei) and the negative impacts of not monitoring are σ(Ei). Then, the utility function uA, when IEA monitors, is defined as in (4): (4) uA=C+i=1KXi(EiE11).εi=1KXi.φ(Ei),(4)

Similarly, the utility function uB, when IEA does not monitor, is as follows: (5) uB=i=1KXi.σ(Ei),(5)

The stability analyses are presented in . shows the evolutionary stability of the equilibrium point that is determined from strategic benefits. If a strategy receives higher benefits than other strategies, then the countries will chooseEi strategy by continuous imitation as an equilibrium strategy in the long-run steady state of the repeated game. For other strategies Ej(ij) or mixed strategies, the equilibrium will be unstable. If the benefits of all strategies are equal, then the countries’ evolutionary strategies are more complex, and bifurcation might occur. Therefore, the IEA should adopt the affordable levels of rewards and penalties to stimulate strategies that yield higher benefits for countries, like, π¯(E1)>max{π¯(E2),,π¯(EK)}.

Table 1. Stability analyses for equilibrium Xi=(0,0,,1,.0)T.

4. Simple example, allocation criteria for emission reduction

In case of multiple countries that are committed to reducing their pollution levels, assume that IEA is going to divide the countries into two grades and this grading system is according to the pollution levels as in L={E1,E2}={1,2}, Suppose φ(Ei)=Ei1,σ(Ei)=1Ei,Γ(Ei)=Ei1,c=1, E_=min(E1,E2)=E1 and E¯=max(E1,E2)=E2 and we examine the interaction between the IEA and countries’ pollution strategies.

For IEA, the expected benefits of adopting the monitoring and not-monitoring strategies are defined as uA=1X2+X2.ε, uB=X2 from EquationEqs. (4) and Equation(5) respectively. If uA>uB, it means thatX2>1(ε2), then the expected benefits of choosing the monitoring strategies are higher than that of not monitoring. our model argues that if a country limits his emission level at i=1KXi=1, then that country can claim his reward in the case when IEA monitorand and its expected benefits is as uA=1X2+X2.ε. In the case, when IEA adopts not-monitoring strategy then its expected benefits is as uB=X2. Therefore, the IEA will decide to adopt the monitoring strategies after a long-term repeated game. However, at P=1, the IEA will become consistent and will be in the equilibrium state at which it could enforce the laws independently. If uA<uB, it means that X2<1(ε2); in this case, the expected benefits of not-monitoring are higher than those of monitoring. Therefore, the IEA will decide to adopt the not-monitoring strategy in the long-run steady state of the population. Moreover, at P=0, the IEA will not be in an equilibrium path, which will lead to failure of supervision and to environmental degradation. If uA=uB, it means that X2=1(ε2) and any P[0,1] is an equilibrium strategy, but not an evolutionary stable strategy. So, we conclude that, if the proportion of countries with pollution levels is higher than the critical values 1(ε2), then the IEA will tend to monitor, otherwise not.

For the countries, the expected benefits of adopting the strategies E1 or E2 are defined asπ¯(E1)=P.γ and π¯(E2)=1P.εP.γ, respectively. Two equilibrium conditions, X1=(1,0)T and X2=(0,1)T are obtained from EquationEq. (A4) from the Appendix. We can conclude that the stable strategies are as follows; if π¯(E1)>π¯(E2) and π¯(E1)>0, i.e., 2P.γ+P.ε>1, the compensation incentives are higher than the punishments. Therefore, in this case, the countries will control their pollution level in the long run. The E1 will be the equilibrium strategy in this case and the IEA will effectively impose regulations to prevent environmental degradation. If π¯(E2)>π¯(E1) and π¯(E2)>0, i.e., P.γ+P.ε<1, then the compensation incentives are lower than the punishments imposed by IEA. Therefore, the countries’ pollution levels will rise in the long run. If the aforementioned conditions are not fulfilled, then there are no evolutionary stable strategies. When this situation occurs, the IEA monitoring will not be significant, and the countries will randomly choose their strategies. So, we conclude that IEA must adjust the reward and penalty system and increase the monitoring frequencies (2P.γ+P.ε>1) in order to encourage the countries to control their pollution levels effectively.

4.1. Allocation principle

For pollution control, the second commitment period of the Kyoto protocol is going to end in 2020 after the extension decided in the UNFCC meeting on climate change. During this period, a different suggestion was introduced for future institutional setup and most of them employ different perspectives. Some of the signatory states suggest a governance structure based on a global emission trading scheme (Bradford Citation2004). Some of the researchers put forward a framework of international technology development (Barrett & Stavins Citation2003; Benedick Citation1998; Rose et al. Citation1998). Some of them propose an institutional framework in terms of political feasibilities (Aldy et al., Citation2003, Citation2010; Bodansky, Citation2004; Ringius et al., Citation2002). The allocation of CO2 emission reduction among countries remained in discussion in all of the proposals during the Kyoto conference and thereafter it is referred to as ‘burden-sharing’. Some other proposals introduce ideas that go beyond the country’s institutional setup that provides partnerships and other non-states mechanisms.

The most challenging task in the research on climate change is allocation. The allocation criteria for emission reduction can be categorized in three broad categories: responsibility, capability, and efficiency (den Elzen et al., Citation2007). Consequently, these criteria create a new allocation method under the same principles. Every nation is responsible for climate change according to this approach. Some proposals allocate emission limitations on a country level such as a clean development mechanism (La Rovere et al., Citation2002) that is based on the polluter pay principle. However, according to the same criteria, earlier industrialized countries like the UK bear more burden in comparison to developed countries like the USA and Japan. Methodologically, some questions arise regarding this criterion, such a show to select the carbon cycle in a model, the starting year of calculation, and how to deal with reasonable data over the past 100 years. Therefore, there was a need to develop a strategic model in which each emitter who is not curious for pollution control should monitor through a strategic plan to sustain global cooperation.

Another responsibility criterion is based on per capita emissions. Although the figures of per capita emission vary between developed and underdeveloped countries and under these criteria, the final allocation should be equal. The formulation of this approach can be found in the literature as ‘contraction and convergence’ (Aslam, Citation2002; den Elzen & Meinshausen, Citation2006). Under this approach, countries are committed to converging their emissions per capita by a certain year, like 2050, 2100. Some studies focus on the allocation criteria under responsibility (see, for example, Ott et al. Citation2004).

The ability to pay is also an allocation scheme that focuses on economic perspectives for pollution control. Under this approach, the parties can only reduce their emission when they have exceeded a certain level of per capita welfare (Jacoby et al., Citation1999). Finally, the efficiency criteria emphasize the allocation of benefits and burden implied by climate mitigation. The allocation under this approach seems fair in terms of economic competitiveness among countries (Kanie et al. Citation2010). Under the allocation principle, a question arises: can a mechanism design reach the large IEA? Our finding provides a stable condition under which countries get more benefits when IEA implements monitoring practices and the stability of the grand coalition will hold. In this line, the Montreal Protocol and the Vienna convention seem real examples that were initially signed by 28 and 46 countries respectively and reached universal ratification in 2009. Our results show an important difference from the literature that has considered the concept of MPC (Rubio & Casino Citation2005). In our case, the number of countries is taken endogenously, while it was exogenous in the reference case. Carraro et al. (Citation2009) choose to consider MPC endogenously by adding the first stage in which countries set their minimal coalition size for a treaty to be implemented.

4.2. How IEA can overcome free riding?

Free rider problem can be overcome by trade measure and issue linkage. The trade measures can be considered as attractive incentive for free ride. High incentives can increase participation which can be mitigated by a trade measure that is more effective for larger participation. Tow kind of trade’s measures can be considered: trade control and trade sanction. ‘A trade control’ looks like an instrument that will be used for regulating a product that might be in the agreement. ‘A trade sanction’ works as a particular act that will force government actions and will take an action for non-compliance or non-conformity to an international standard. The trade control is adopted for international environmental treaties like Montreal protocol.Footnote5 Therefore, Brack, Grubb, and Windram (Citation2000) highlighted that such kind of controls for greenhouse gases could create the difficulties for applying and could create the severe limitation for trade as well as a higher welfare loss. However, they suggest that similar kind of controls can be very operative and can be considered for developing a climate rule.Footnote6 While in the case of ‘trade sanctions’, any ecological treaty can’t use it as an instrument to implement like WTO.Footnote7 Victor (Citation2004) highlighted that implementation of ‘trade sanctions’ in the climate change system can successfully connected with WTO.Footnote8 However, the studies on the role of economic sanction in global organization can’t promise high efficiency. Even though, the implementation is not being enforced positively, so the climate regimes must take the alternative enforcement techniqueFootnote9 (Charnovitz Citation2003). In spite of various difficult problems, as well as compatibility with WTO rule, implementations of the trade measure cannot be positively applied. Such kind of implementations measures might be consider as the last option that can guaranteed a real cooperative agreement.

The free rider incentive could be dealt with another strategic policy that is the ‘issue linkage’. Cooperation that is exaggerated by free-riding—i.e., the cooperation for global climate change—could be associated with the club or quasi-club goods. The perception might be that an incentive for free-riding with non-excluded benefit of public goods could be equalized with an incentive that will come from excluded benefit from the provision of club goods. Barrett (Citation1994, Citation1997) offers a link that will protect the environment to negotiate on trade liberalization. In this way, potential free-riders are frightened with the threats of trade sanctions. In Carraro and Moriconi (Citation1997), the environmental cooperation is linking to ‘cooperation in Research and Development’. If any country doesn’t want to join an agreement to control the environment, then it will lose the benefits from technological cooperation. Mohr (Citation1995) and Mohr and Thomas (Citation1998) propose linking climate negotiations to international debt swaps.

5. Conclusion and policy

In this paper, we provide a novel evolutionary game theoretic model of self-enforcing IEA. For this purpose, we assign countries into different grade categories according to their pollution levels, consider a combination of rewards and penalties, use replicator dynamics to derive the conditions for the population steady state, and examine how the proposed regulatory mechanism fares in this steady state. This framework enables us to avoid the free rider and the renegotiation problems of IEAs, the rationality and common knowledge of rationality assumptions of the traditional game theory. We establish the condition for a steady state. The global environmental problem is managed effectively as the reward-punishment scheme and monitoring frequency of the IEA fulfills this condition under MPC.

Our main findings indicate that in the steady state of the game, countries conform to their agreed pollution levels, when the IEA’s rewards and penalties, and the monitoring frequency meet the equilibrium condition. This means that the IEA can fulfill its duties and prevent environmental degradation efficiently in the case of equilibrium under the evolutionary game. Otherwise, the benefits for deviation will be higher than the reward for accommodation leading to increased pollution levels. We conclude that, in order to keep global pollution levels under control, the IEA needs to continuously adjust the level of rewards and punishments as well as the monitoring frequency.

We impose a restriction on IEA such that countries only have one option to control their pollution levels. Therefore, in a repeated game, the penalty will be imposed only if the country pollution level increase. In reality, countries may not be flexible to adjust their pollution levels. However, they have another option to prevent deviation. For example, one option is trade sanctions imposed on the signatories of the IEA. To sustain global cooperation, Barrett (Citation1997) shows that imposing trade sanctions is the only possible way that can provide necessary incentives to countries to join IEA. Allowing for trade sanctions in an evolutionary game theoretic model could be a valuable extension of our model.

Due to the generalizability of our model, our mechanism design could serve as a benchmarking mechanism that could be furnished and refined in the future. We observe that the pollution targeting policies put forward in Bollen et al. (Citation2009), considering the effects of global and local pollution could help improve global cooperation. Considering the local spillover of pollution, our reward-penalty mechanism could be designed accordingly helping countries to avoid free riders without making the agreement vulnerable to renegotiation. In this way, some countries (US, UK, EU) that initially agreed to reduce the pollution on a conditional basis of others joining them, might achieve the global cooperation by taking the pollution structure into account. Last but not least, our mechanism design is not limited to the application of pollution reduction. It can be easily applied to other problems pertaining to the provision of global public goods.

A surprising policy implications emerge from model. Both, ‘polluter pay principle’ and ‘ability-to-pay principle’ dictates that rich countries are required to abate. It is a central force behind the Kyoto Protocol in which reducing global inequality was clear goal. If Annex-1 countries were allowed to purchase tradeable pollution permits from Non-Annex-1 countries to meet their abatement requirements. In this way, a transfer mechanism from rich to poor countries can be settled for pollution control. But in case of decreasing return to scale, all countries will be better off if a poor make a transfer to rich. Rich countries bring more to agreement, but receive higher incentive to remain outside. To overcome this free riding incentive is a reason that rich countries are receiving positive transfer. Effective IEA must be pare to-improving and increase global inequality. One possible reason in this inequality is issue-linkage. Therefore, a multi-issue agreement has a potential to increase payoffs and reduce inequality.

Abbreviations
CITES=

Convention on International Trade in Endangered Species

IEA=

International Environmental Agreements

MPC=

Minimum Participation Clause

ODS=

Ozone-Depleting Substances

UNFCC=

United Nations Framework Convention on Climate Change

Notes

1 Examples are the Oslo protocol in 1994 for reducing sulfur, the Montreal protocol in 1987 for the depletion of the ozone layer and the Kyoto protocol in 1997 for the reduction of greenhouse gas emissions.

2 This point is argued by Heister et al.(Citation1997) which provides the qualitatives analyses of why IEAs need enforcement and best strategy to enforce a compliance.

3 Convention on International Trade in Endangered Species, Washington, March 1973; Amended 1979.

4 For example, phasing out ODS under the Monotoreal Protocol. The cost to implement the ODS under Monotoreal Protocol was estimated between $5 Billion and $12 Billions for USA and $235 Billions for global (Barrett Citation2003).

5 In the Montreal Protocol, parties are required to ban trade with non-parties of ozone-depleting substances and products containing them.

6 More limited measures such as the application of duties or taxed against various categories of imports from non-parties could also be employed, according to (Brack et al. Citation2000).

7 The only two international organizations that impose trade sanctions against non-compliance are the UN Security Council and the WTO.

8 (Victor Citation2004) suggests a program of penalty tariffs and trade sanctions to counteract the economic advantage gained through non-compliance. Schram Stokke (Citation2004) has also argued that trade measures could be an effective instrument against non-compliance. He predicts that such sanctions would work best if they were carried out multilaterally against the country at fault.

9 One possibility would be to enhance transparency and public participation in the international supervisory system in the hope of putting internal political pressure on governments to comply. The climate regime could also consider the use of monetary assessments against non-complying governments, a technique employed in the European Union, and being tested in new free trade agreements, e.g., U.S.-Singapore.

10 Schlag (Citation1998) shows that the dynamics of a population with randomly matched bounded rational players changing their strategies via imitation of successful strategies observed can be approximated by replicator dynamics.

References

  • Ahmad, N., & Du, L. (2017). Effects of energy production and CO2 emissions on economic growth in Iran: ARDL approach. Energy, 123, 521–537. https://doi.org/10.1016/j.energy.2017.01.144
  • Ahmad, N., Du, L. S., Lu, J., Wang, J. L., Li, H., & Zaffar, M. (2017). Modelling the CO2 emissions and economic growth in Croatia: Is there any environmental Kuznets curve? Energy, 123, 164–172. https://doi.org/10.1016/j.energy.2016.12.106
  • Ahmad, N., Du, L., Tian, X.-L., & Wang, J. (2019). Chinese growth and dilemmas: Modelling energy consumption, CO2 emissions and growth in China. Quality & Quantity, 53(1), 315–338. https://doi.org/10.1007/s11135-018-0755-0
  • Aldy, J. E., Barrett, S., & Stavins, R. N. (2003). Thirteen plus one: A comparison of global climate policy architectures. Climate Policy, 3(4), 373–397. https://doi.org/10.1016/j.clipol.2003.09.004
  • Aldy, J. E., Krupnick, A. J., Newell, R. G., Parry, I. W. H., & Pizer, W. A. (2010). Designing climate mitigation policy. Journal of Economic Literature, 48(4), 903–934. https://doi.org/10.1257/jel.48.4.903
  • Arguedas, C. (2008). To comply or not to comply? Pollution standard setting under costly monitoring and sanctioning. Environmental and Resource Economics, 41(2), 155–168. https://doi.org/10.1007/s10640-007-9185-5
  • Asheim, G. B., & Holtsmark, B. (2009). Renegotiation-proof climate agreements with full participation: Conditions for Pareto-efficiency. Environmental and Resource Economics, 43(4), 519–533. https://doi.org/10.1007/s10640-008-9247-3
  • Aslam, M. A. (2002). Equal per capita entitlements: A key to global participation on climate change?. In K. Baumert (Ed.), Building on the Kyoto protocol options for protecting the climate. World Resources Institute.
  • Barrett, S. (1994). Self-enforcing international environmental agreements. Oxford Economic Papers, 46(Suppl_1), 878–894. https://doi.org/10.1093/oep/46.Supplement_1.878
  • Barrett, S. (1997). Heterogeneous international environmental agreements.
  • Barrett, S. (1999). International cooperation and the global environment. Global public goods.
  • Barrett, S. (2001). International cooperation for sale. European Economic Review, 45(10), 1835–1850. https://doi.org/10.1016/S0014-2921(01)00082-4
  • Barrett, S. (2003). Environment and statecraft: The strategy of environmental treaty-making. Oxford University Press.
  • Barrett, S., & Stavins, R. (2003). Increasing participation and compliance in international climate change agreements. International Environmental Agreements, 3(4), 349–376. https://doi.org/10.1023/B:INEA.0000005767.67689.28
  • Benchekroun, H., & Long, N. (2012). Collaborative environmental management: A review of the literature. International Game Theory Review, 14(04), 1240002. https://doi.org/10.1142/S0219198912400026
  • Benedick, R. E. (1998). Ozone diplomacy: New directions in safeguarding the planet. Harvard University Press.
  • Bernauer, T., Kalbhenn, A., Koubi, V., & Spilker, G. (2010). A comparison of international and domestic sources of global governance dynamics. British Journal of Political Science, 40(3), 509–538. https://doi.org/10.1017/S0007123410000098
  • Bernauer, T., Kalbhenn, A., Koubi, V., & Spilker, G. (2013). Is there a “depth versus participation” dilemma in international cooperation? The Review of International Organizations, 8(4), 477–497. https://doi.org/10.1007/s11558-013-9165-1
  • Bodansky, D. (2004). International climate efforts beyond 2012: A survey of approaches.
  • Bollen, J., van der Zwaan, B., Brink, C., & Eerens, H. (2009). Local air pollution and global climate change: A combined cost-benefit analysis. Resource and Energy Economics, 31(3), 161–181. https://doi.org/10.1016/j.reseneeco.2009.03.001
  • Brack, D., Grubb, M., & Windram, C. (2000). International trade and climate change policies. Earthscan.
  • Bradford, D. F. (2004). Improving on Kyoto: Greenhouse gas control as the purchase of a global public good. Center for Economic Policy Studies, Princeton University.
  • Breton, M., Sbragia, L., & Zaccour, G. (2010). A dynamic model for international environmental agreements. Environmental and Resource Economics, 45(1), 25–48. https://doi.org/10.1007/s10640-009-9304-6
  • Calvo, E., & Rubio, S. J. (2013). Dynamic models of international environmental agreements: A differential game approach. International Review of Environmental and Resource Economics, 6(4), 289–339. https://doi.org/10.1561/101.00000053
  • Carraro, C., & Moriconi, F. (1997). International games on climate change control. Fondazione Eni Enrico Mattei Working Paper (22.98).
  • Carraro, C., & Siniscalco, D. (1993). Strategies for the international protection of the environment. Journal of Public Economics, 52(3), 309–328. https://doi.org/10.1016/0047-2727(93)90037-T
  • Carraro, C., & Siniscalco, D. (1998). International environmental agreements: Incentives and political economy. European Economic Review, 42(3–5), 561–572. https://doi.org/10.1016/S0014-2921(97)00118-9
  • Carraro, C., Marchiori, C., & Oreffice, S. (2009). Endogenous minimum participation in international environmental treaties. Environmental and Resource Economics, 42(3), 411–425. https://doi.org/10.1007/s10640-009-9264-x
  • Charnovitz, S. (2003). Trade and climate: potential conflicts and synergies. In Aldy, J.E., Ashton, J., Baron, R., Bodansky, R., Charnovitz, S., Diringer, E., Heller T. C., Pershing, J., Shukla P.R., Tubiana, L., Tudela, F., Wang, X., & Beyond Kyoto (eds.), Advancing the international effort against climate change, Pew Centre of Global Climate Change, 141–170. Arlington, VA.
  • D’Aspremont, C., Jacquemin, A., Gabszewicz, J. J., & Weymark, J. A. (1983). On the stability of collusive price leadership. The Canadian Journal of Economics, 16(1), 17–25. https://doi.org/10.2307/134972
  • den Elzen, M. G. J., & Meinshausen, M. (2006). Multi-gas emission pathways for meeting the EU 2_C climate target. In H. J. Schellnhuber, W. Cramer, N. Nakicenovic, T. Wigley, & G. Yohe (Eds.), Avoiding dangerous climate change. Cambridge University Press.
  • den Elzen, M. G. J., Höhne, N., Brouns, B., Winkler, H., & Ott, H. E. (2007). Differentiation of countries’ future commitments in a post-2012 climate regime: An assessment of the “South–North Dialogue”. Environmental Science & Policy, 10(3), 185–203. https://doi.org/10.1016/j.envsci.2006.10.009
  • Depledge, J. (2000). Tracing the origins of the Kyoto protocol: An article-by-article textual history. Technical paper, UNFCCC. http://unfccc.int/resource/docs/tp/tp0200.pdf. Accessed 24 November 2017.
  • Downs, G. W., Rocke, D. M., & Barsoom, P. N. (1996). Is the good news about compliance good news about cooperation? International Organization, 50(3), 379–406. https://doi.org/10.1017/S0020818300033427
  • Ellingsen, T., & Östling, R. (2011). Strategic risk and coordination failure in blame games. Economics Letters, 110(2), 90–92. https://doi.org/10.1016/j.econlet.2010.10.018
  • Farrell, J., & Maskin, E. (1989). Renegotiation in repeated games. Games and Economic Behavior, 1(4), 327–360. https://doi.org/10.1016/0899-8256(89)90021-3
  • Froyn, C. B., & Hovi, J. (2008). A climate agreement with full participation. Economics Letters, 99(2), 317–319. https://doi.org/10.1016/j.econlet.2007.07.013
  • Günther, M., & Hellmann, T. (2017). International environmental agreements for local and global pollution. Journal of Environmental Economics and Management, 81, 38–58. https://doi.org/10.1016/j.jeem.2016.09.001
  • Hathaway, O. A. (2003). The cost of commitment. Stanford Law Review, 55(5), 1821–1862.
  • Heister, J., Mohr, E., Stähler, F., Stoll, T., & Wolfrum, R. (1997). Strategies to enforce compliance with an international CO2-treaty. International Environmental Affairs, 1, 22–53.
  • Heitzig, J., Lessmann, K., & Zou, Y. (2011). Self-enforcing strategies to deter free-riding in the climate change mitigation game and other repeated public good games. Proceedings of the National Academy of Sciences of the United States of America, 108(38), 15739–15744. https://doi.org/10.1073/pnas.1106265108
  • Hoel, M. (1992). International environment conventions: The case of uniform reductions of emissions. Environmental and Resource Economics, 2(2), 141–159.
  • Hovi, J., Ward, H., & Grundig, F. (2015). Hope or despair? Formal models of climate cooperation. Environmental and Resource Economics, 62(4), 665–688. https://doi.org/10.1007/s10640-014-9799-3
  • Ivanova, A., Zia, A., Ahmad, P., & Bastos-Lima, M. (2020). Climate mitigation policies and actions: Access and allocation issues. International Environmental Agreements: Politics, Law and Economics, 20(2), 287–301. https://doi.org/10.1007/s10784-020-09483-7
  • Jacoby, H. D., Schmalensee, R., & Wing, I. S. (1999). Toward a useful architecture for climate change negotiations. Report No 49.
  • Kanie, N., Nishimoto, H., Hijioka, Y., & Kameyama, Y. (2010). Allocation and architecture in climate governance beyond Kyoto: Lessons from interdisciplinary research on target setting. International Environmental Agreements: Politics, Law and Economics, 10(4), 299–315. https://doi.org/10.1007/s10784-010-9143-5
  • Koo, C., & Hong, T. (2017). Development of a dynamic incentive and penalty program for improving the energy performance of existing buildings. Technological and Economic Development of Economy, 24(2), 295–317. https://doi.org/10.3846/20294913.2016.1212741
  • Kratzsch, U., Sieg, G., & Stegemann, U. (2012). An international agreement with full participation to tackle the stock of greenhouse gases. Economics Letters, 115(3), 473–476. https://doi.org/10.1016/j.econlet.2011.12.107
  • La Rovere, E. L., de Macedo, L. V., & Baumert, K. A. (2002). The Brazilian proposal on relative responsibility for global warming. In A. Baumert (Ed.), Building on the Kyoto protocol options for protecting the climate (pp. 157–173). World Resources Institute.
  • McGinty, M. (2006). International environmental agreements among asymmetric nations. Oxford Economic Papers, 59(1), 45–62. https://doi.org/10.1093/oep/gpl028
  • McGinty, M. (2010). International environmental agreements as evolutionary games. Environmental and Resource Economics, 45(2), 251–269. https://doi.org/10.1007/s10640-009-9314-4
  • Miao, D. Y., Huang, W. W., Li, Y. P., & Yang, Z. F. (2014). Planning water resources systems under uncertainty using an interval-fuzzy De Novo programming method. Journal of Environmental Informatics, 24(1), 11–23. https://doi.org/10.3808/jei.201400277
  • Mohr, E. (1995). International environmental permit trade and debt: The consequences of country sovereignty and cross-default policies. Review of International Economics, 3(1), 1–19. https://doi.org/10.1111/j.1467-9396.1995.tb00048.x
  • Mohr, E., & Thomas, J. P. (1998). Pooling sovereign risks: The case of environmental treaties and international debt. Journal of Development Economics, 55(1), 173–190. https://doi.org/10.1016/S0304-3878(97)00061-8
  • Ott, H. E., Winkler, H., Brouns, B., Kartha, S., Mace, M. J., Huq, S., Kameyama, Y., Sari, A. P., Pan, J., Sokona, Y., Bhandari, P. M., Kassenberg, A., La Rovere, E. L., & Rahman, A. A. (2004). South-north dialogue on equity in the greenhouse: A proposal for an adequate and equitable global climate agreement.
  • Pavone, I. R. (2018). The Paris Agreement and the Trump administration: Road to nowhere? Journal of International Studies, 11(1), 34–49. https://doi.org/10.14254/2071-8330.2018/11-1/3
  • Ringius, L., Torvanger, A., & Underdal, A. (2002). Burden sharing and fairness principles in international climate policy. International Environmental Agreements, 2(1), 1–22. https://doi.org/10.1023/A:1015041613785
  • Roberts, J. T., Parks, B. C., & Vásquez, A. A. (2004). Who ratifies environmental treaties and why? Institutionalism, structuralism and participation by 192 nations in 22 treaties. Global Environmental Politics, 4(3), 22–64. https://doi.org/10.1162/1526380041748029
  • Rose, A., Stevens, B., Edmonds, J., & Wise, M. (1998). International equity and differentiation in global warming policy. Environmental and Resource Economics, 12(1), 25–51. https://doi.org/10.1023/A:1008262407777
  • Rubio, S. J., & Casino, B. (2005). Self-enforcing international environmental agreements with a stock pollutant. Spanish Economic Review, 7(2), 89–109. https://doi.org/10.1007/s10108-005-0098-6
  • Rubio, S. J., & Ulph, A. (2007). An infinite-horizon model of dynamic membership of international environmental agreements. Journal of Environmental Economics and Management, 54(3), 296–310. https://doi.org/10.1016/j.jeem.2007.02.004
  • Schlag, K. H. (1998). Why imitate, and if so, how? Journal of Economic Theory, 78(1), 130–156. https://doi.org/10.1006/jeth.1997.2347
  • Schram Stokke, O. (2004). Trade measures and climate compliance: Institutional interplay between WTO and the Marrakesh accords. International Environmental Agreements, 4(4), 339–357. https://doi.org/10.1007/s10784-004-2471-6
  • Spilker, G., & Koubi, V. (2016). The effects of treaty legality and domestic institutional hurdles on environmental treaty ratification. International Environmental Agreements: Politics, Law and Economics, 16(2), 223–238. https://doi.org/10.1007/s10784-014-9255-4
  • Stranlund, J. K. (2007). The regulatory choice of noncompliance in emissions trading programs. Environmental and Resource Economics, 38(1), 99–117. https://doi.org/10.1007/s10640-006-9058-3
  • Suo, M. Q., Li, Y. P., Huang, G. H., Deng, D. L., & Li, Y. F. (2013). Electric power system planning under uncertainty using inexact inventory nonlinear programming method. Journal of Environmental Informatics, 22(1), 49–67. https://doi.org/10.3808/jei.201300245
  • Taylor, P. D., & Jonker, L. B. (1978). Evolutionary stable strategies and game dynamics. Mathematical Biosciences, 40(1-2), 145–156. https://doi.org/10.1016/0025-5564(78)90077-9
  • Tramel, S. (2016). The road through Paris: Climate change, carbon, and the political dynamics of convergence. Globalizations, 13(6), 960–969. https://doi.org/10.1080/14747731.2016.1173376
  • Victor, D. G. (2004). The collapse of the Kyoto Protocol and the struggle to slow global warming. Princeton University Press.
  • Zhang, N., Li, Y. P., Huang, W. W., & Liu, J. (2014). An inexact two-stage water quality management model for supporting sustainable development in a rural system. Journal of Environmental Informatics, 24(1), 52–64. https://doi.org/10.3808/jei.201400274

Appendix:

Stability analysis for pollution strategies

A1. Dynamic equation

The pollution dilemma of the countries is a long-term repeated procedure. Due to informational and computational (or political) deficiencies, the IEA and the countries may not reach their optimal strategy at the beginning of the game. However, by following an evolutionary process, the IEA and the countries will adjust their strategies to find out the best optimal path. The dynamics of this transformation is captured by the replicator dynamics equationFootnote10. From EquationEqs. (2) and Equation(3), the expected benefits of the overall countries will be as π¯=P.i=1KXiπA(Ei)+(1P)i=1KXiπB(Ei). The countries that choose Ei will have expected benefits π¯(Ei)=P. πA(Ei)+(1P). πB(Ei). The expected benefits of the IEA at time t from EquationEqs. (4) and Equation(5) are as follows: u¯=P.uA+(1P).uB

Then, the replicator dynamic equation of Taylor and Jonker (Citation1978) for the IEA and countries is as follows: (A1) dPdt=P(uAu¯)(A1) (A2) dX(Ei)dt=X(Ei)(π¯(Ei)π¯), i=1,2,3,,K(A2)

Further, EquationEq. (A2) is taken as: (A3) dXidt=Xi.(1Xi).[π¯(Ei)j=1jiKWj.π¯(Ej)], i=1,2,.,K(A3) where Wj=Xj(1Xi),j=1,2,,K,ji,π¯(Ek)=Γ(Ek)+P.[(EkE11).ε+γ.Rk], and k=1,,K.

From EquationEq. (A3), the IEA’s monitoring probabilities (P), reward strategies (γ.Rk) and penalty strategies (EkE11) affect the expected benefits of countries fromEk. However, the evolutionary dynamics depends on the proportion of strategies (Ei). If the proportion of low polluters is high, then the proportion of conformers increases in the population.

A2. Stability analysis for the country’s pollution strategies

For the IEA, suppose, f(P)=dpdt=P(1P)(uAuB)=0, then the results are straight forward. If uA=uB, then any monitoring probability P[0,1] is in equilibrium; if uAuB, then monitoring probabilities P=0 or P=1 are IEA’s strategy in the evolutionary equilibrium; if uA<uB,P=0, then the IEA chooses not to monitor, since the payoff of not monitoring exceeds that of monitoring strategies. If uA>uB,P=1, then the IEA will monitor in the stable state of the population and will receive higher benefits than in the case of not monitoring.

For countries, suppose fi(Xi)=(dXidt)=0 and we get the equation sets as follows: (A4) X1[π¯(E1){X1π¯(E1)+X2π¯(E2)++XKπ¯(EK)}]=0,X2[π¯(E2){X1π¯(E1)+X2π¯(E2)++XKπ¯(EK)}]=0, . . . .Xk[π¯(EK){X1π¯(E1)+X2π¯(E2)++XKπ¯(EK)}]=0.(A4)

Then, the results by combining i=1KXi=1 are as follows; if π¯(E1)=π¯(E2)==π¯(EK) then any X=(X1,X2,,XK) will be in the equilibrium. If π¯(Ei)π¯(Ej), where ij, then the equilibrium strategies of the countries under the evolutionary game will be as (A5) X=(X1,X2,,Xi1,Xi,Xi+1,,XK) =(0,0,,0,1,0,,0)T, i=1,2,,K.(A5)

For the evolutionary dynamics of differential equations, we apply the Jacobin Matrix to examine the stability with a balanced point. For more precision, we denote π¯(Ei) as π¯i to represent the benefits by adopting the Ei strategies. The Jacobian Matrix derived from fi(Xi)=dXidt=0 is written as follows: (A6) J(X)=(f1,f2,,fK)(X1,X2,,XK)=[f1X1 f1X2f1XKf2X1 f2X2f2XK . . .fKX1 fKX2fKXK](A6)

Or J(X)=[(1X1)π¯1j=1KXjπ¯(Ej) X1π¯2 X1π¯3 . X1π¯KX2π¯1 (1X2)π¯2j=1KXjπ¯(Ej) X2π¯3 . X2π¯K . . . . . . . . . . . . .XKπ¯1 XKπ¯2 XKπ¯3 (1XK)π¯Kj=1KXjπ¯(Ej) ]

By substituting the equilibrium of Xi=(0,0,,1,,0)T in the Jacobian Matrix, we get the Eigen values as follows: (A7) λi1=π¯1π¯i,,λii=π¯i,λi,j+1=π¯i+1π¯i,,λiK=π¯Kπ¯i.(A7)