Social:Fair random assignment

From HandWiki

Fair random assignment (also called probabilistic one-sided matching) is a kind of a fair division problem. In an assignment problem (also called house-allocation problem or one-sided matching), there m objects and they have to be allocated among n agents. Usually, mn, and each agent has to receive at most one object. Examples include the assignment of jobs to workers, rooms to housemates, dormitories to students, time-slots to users of a common machine, and so on.

In general, a fair assignment may be impossible to attain. For example, if Alice and Batya both prefer the eastern room to the western room, only one of them will get it and the other will be envious. In the random assignment setting, fairness is attained using a lottery. So in the simple example above, Alice and Batya will toss a fair coin and the winner will get the eastern room.

History

Random assignment is mentioned already in the Bible: a lottery was used to allocate the lands of Canaan among the Tribes of Israel (Numbers 26:55).

In the US, lotteries were used to assign public lands to homesteaders (e.g. Oklahoma in 1901), and to assign radio spectra to broadcasters (e.g. FCC 1981-1993). Lottery is still used to assign green cards.[1]

Methods

There are several ways to extend the "coin toss" method to situations in which there are more than two agents, and they may have different preference relations on the objects:[2][3][4]

  • Random Priority (RP, aka Random Serial Dictatorship or RSD) is a very simple mechanism that only requires agents to have ordinal ranking on individual items. It chooses a random priority-ordering on the items and lets each agent in turn pick his favorite remaining item.
  • Probabilistic Serial (PS) is another mechanism that works only with ordinal ranking on items. Agents "eat" their favorite remaining items in a constant speed, and the fraction each agent managed to eat is his/her probability to get this item.
  • Competitive Equilibrium from Equal Incomes (CEEI) is a market-based mechanism: each item is viewed as a divisible commodity. Each agent is given [math]\displaystyle{ 1/n }[/math]-share of each commodity, then the agents are allowed to trade until there is equilibrium.[5] This is a more complex mechanism that requires the agents to have full cardinal utility functions (or, alternatively, ordinal ranking on lotteries).

Properties

One desired property of a random assignment rule is Pareto efficiency (PE). There are two variants of PE:

  • Ex-post PE means that, after the final allocation is determined, no other allocation is better for some agent and at least as good for the others. All three rules above (RP, PS and CEEI) are ex-post PE.
  • Ex-ante PE is a stronger property. It means that no other lottery is better for some agent and at least as good for the others. To define this property, it is important to know how agents compare lotteries. CEEI is ex-ante PE when agents compare lotteries based on their expected utility. PS is ex-ante PE when agents compare lotteries by stochastic dominance (this property is called ex-ante sd-PE). In contrast, RP is not ex-ante PE.

Another desired property is envy-freeness (EF). Again, there are two variants of EF:

  • Ex-post EF means that, after the final allocation is determined, no agent prefers the allocation of another agent. No rule satisfies this strong property; indeed, it may be impossible to find an ex-post EF allocation of indivisible objects.
  • Ex-ante EF means that no agent prefers the lottery of another agent. Again, defining this property requires to know how agents compare lotteries. CEEI is ex-ante EF w.r.t. expected utilities. PS is ex-ante EF w.r.t. stochastic dominance (this property is called ex-ante sd-EF). RP is weakly ex-ante sd-EF; it is EF when agents compare lotteries by lexicographic dominance (ld-EF).[6]

A third desired property is truthfulness (also called strategyproofness). CEEI is not truthful. PS is weakly sd-truthful when there is at most one object per person; in this setting it is also truthful w.r.t. lexicographic dominance (ld-truthful).[6] RP is sd-truthful in this case, and it can be extended in a truthful way also to the general case when there are more objects than people. The following table compares the various rules' properties (the RP and PS columns are based on [7]):

#items ≤ #agents #items > #agents
RP PS CEEI RP PS CEEI
Ex-ante efficiency No sd-PE PE No sd-PE PE
Ex-ante fairness Weak sd-EF;

ld-EF

sd-EF EF Weak sd-EF sd-EF EF
Truthfulness sd-truthful Weak sd-truthful;

ld-truthful

No sd-truthful* No No

Decomposing a fractional allocation

Both the PS and the CEEI rules compute a matrix of expected assignments, i.e., the marginal probabilities with which each agent receives each object. However, since the final allocation must be a matching, one must find a decomposition of this matrix into a lottery on matchings.

In the classic setting, in which m=n, this can be done using the Birkhoff algorithm. It can decompose any n-by-n matrix of agent-object probabilities into a convex combination of O(n2) permutation matrices, each of which represents a matching. However, the decomposition is not unique, and some decompositions may be better than others.

Budish, Che, Kojima and Milgrom[1] generalize Birkhoff's algorithm to arbitrary m and n. They also allow to add constraints on the assignments, under a maximal set of conditions on the set of constraints. They also present a decomposition method that minimizes the variance in the utility experienced by the agents between the different matchings.

Demeulemeester, Goossens, Hermans and Roel[8] present a polynomial-time decomposition algorithm that maximizes the worst-case number of agents who receive an object. Their algorithm guarantees that the worst-case number of agents equals the expected number of agents rounded down, which is the best possible. They present another decomposition algorithm that maximizes the worst-case number of assigned agents while guaranteeing that all matchings in the decomposition be ex-post PE; the second algorithm can be used only for fractional assignments outputted by PS, but not those corresponding to RP. For RP, it is only possible to attain a 1/2-factor approximation to the optimal worst-case number of assigned agents. For general fractional assignments, maximizing the worst-case number of assigned agents subject to ex-post PE is NP-hard. They also present a column generation framework that can be used to optimize other worst-case criteria.

Empirical comparison

Hosseini, Larson and Cohen[7] compare RP to PS in various settings. They show that:

  • When there are at most 2 objects and at most 3 agents, RP and PS return the same allocation.
  • When there are at most 2 objects, for any number of agents, PS is sd-truthful and RP is sd-envy-free, and in most instances, PS dominates RP, particularly with 4 or more agents.
  • When there are 3 or more objects (and 3 or more agents), RP and PS may return different allocations, and no allocation Pareto-dominates the other. For example, suppose there are three objects a,b,c and three agents with preference-rankings (1) a>c>b, (2) a>b>c, (3) b>a>c. Then, to agent (1), both RP and PS give 1/2 a + 1/2 c; to agent (2), RP gives 1/2 a + 1/6 b + 1/3 c while PS gives 1/2 a + 1/4 b + 1/4 c which is stochastically-dominant; and to agent (3), RP gives 5/6 b + 1/6 c while PS gives 3/4 b + 1/4 c which is stochastically-dominated. So (1) is indifferent, (2) strictly prefers PS and (3) strictly prefers RP.
  • The fraction of preference profiles for which PS sd-dominates RP is large when the number of agents and objects differ, but approaches 0 when the numbers are equal. The same is true for ld-domination.
  • When agents are risk-neutral, the expected social welfare of PS is larger than RP, but the difference is substantial only when n≠m. With RP, the fraction of envious agents is near zero when nm. PS is manipulable, and the gain from manipulation increases when m>n.
  • When agents are risk-seeking, the expected social welfare of PS is larger than RP, and the difference grows rapidly when n≠m. In contrast, when n=m RP attains a higher social welfare in most cases. With RP, the fraction of envious agents is near zero when nm, but generates envy when m>n. The envy of RP decreases when risk-seekingness increases. The gain from manipulating PS decreases when agents are more risk-seeking.
  • When agents are risk-averse, the social welfare gap between RP and PS becomes smaller (though still statistically-significant). The fraction of envious agents in RP increases, but the envy remains below 0.01 when nm. The manipulability of PS goes to 1 when m/n grows.

Non-linear utilities

Tao and Cole[9] study the existence of PE and EF random allocations when the utilities are non-linear (can have complements).

See also

  • Rental harmony is a variant of the assignment problem in which fairness is attained using monetary payments, instead of randomization.
  • Fair item allocation is a setting in which agents may get more than one item.
  • Sortition - random selection of political officials.

References

  1. 1.0 1.1 Budish, Eric; Che, Yeon-Koo; Kojima, Fuhito; Milgrom, Paul (2013-04-01). "Designing Random Allocation Mechanisms: Theory and Applications" (in en). American Economic Review 103 (2): 585–623. doi:10.1257/aer.103.2.585. ISSN 0002-8282. https://www.aeaweb.org/articles?id=10.1257/aer.103.2.585. 
  2. Bogomolnaia, Anna; Moulin, Hervé (2001). "A New Solution to the Random Assignment Problem". Journal of Economic Theory 100 (2): 295. doi:10.1006/jeth.2000.2710. 
  3. Yılmaz, Özgür (2009). "Random assignment under weak preferences". Games and Economic Behavior 66: 546–558. doi:10.1016/j.geb.2008.04.017. http://cdm21054.contentdm.oclc.org/cdm/ref/collection/IR/id/6207. 
  4. Katta, Akshay-Kumar; Sethuraman, Jay (2006). "A solution to the random assignment problem on the full preference domain". Journal of Economic Theory 131: 231–250. doi:10.1016/j.jet.2005.05.001. 
  5. Hylland, Aanund; Zeckhauser, Richard (1979). "The Efficient Allocation of Individuals to Positions". Journal of Political Economy 87 (2): 293. doi:10.1086/260757. 
  6. 6.0 6.1 Kate, Hosseini, Hadi Larson (2015-07-24). Strategyproof Quota Mechanisms for Multiple Assignment Problems. OCLC 1106222190. http://worldcat.org/oclc/1106222190. 
  7. 7.0 7.1 Hadi Hosseini, Kate Larson, Robin Cohen (2018). "Investigating the characteristics of one-sided matching mechanisms under various preferences and risk attitudes". Autonomous Agents and Multi-Agent Systems 32 (4): 534–567. doi:10.1007/s10458-018-9387-y. https://link.springer.com/article/10.1007/s10458-018-9387-y. 
  8. Demeulemeester, Tom; Goossens, Dries; Hermans, Ben; Leus, Roel (2021-01-03). "Risk aversion in one-sided matching". arXiv:2101.00579 [cs.DS].
  9. Cole, Richard; Tao, Yixin (2021-04-01). "On the existence of Pareto Efficient and envy-free allocations" (in en). Journal of Economic Theory 193: 105207. doi:10.1016/j.jet.2021.105207. ISSN 0022-0531. https://www.sciencedirect.com/science/article/pii/S0022053121000247.