Search results

Decentralized partially observable Markov decision process
...n and [[decision-making]] among multiple agents. It is a [[probabilistic]] model that can consider [[uncertainty]] in outcomes, sensors and communication (i * <math>A_i</math> is a set of actions for agent <math>i</math>, with <math>A=\times_i A_i</math> is the set of joint action ...

3 KB (513 words) - 00:27, 26 June 2024
Grammar systems theory
...s a formalization of decentralized or distributed systems of [[Intelligent agent|agents]] in [[artificial intelligence]].<ref name="gramsysdis">{{cite thesi Let <math>\mathbb{A}</math> be a simple [[reactive agent]] moving on the table and trying not to fall down from the table with two r ...

5 KB (712 words) - 19:22, 9 January 2023
Empowerment (artificial intelligence)
...ence its environment.<ref name=klyubin2005a /><ref name=klyubin2005b /> An agent which follows an empowerment maximising policy, acts to maximise future opt ...the [[Motor cognition#Perception-action coupling|perception-action loop]]. Agent state and actions are modelled by random variables (<math>S: s \in \mathcal ...

6 KB (912 words) - 16:03, 21 November 2024
Action model learning
...within its ''environment''. This knowledge is usually represented in logic-based [[action description language]] and used as the input for [[automated plann Learning action models is important when goals change. When an agent acted for a while, it can use its accumulated knowledge about actions in th ...

7 KB (968 words) - 15:22, 24 February 2025
Inverse planning
'''Inverse Planning''' refers to the process of inferring an agent's mental states, such as goals, beliefs, emotions, etc., from actions by as ...nverse Reinforcement Learning]], which attempts to learn a reward function based on agents' behavior, and [[Activity recognition|plan recognition]], which f ...

7 KB (940 words) - 09:22, 11 November 2024
Social cognitive optimization
...rithm which was developed in 2002.<ref name="xzy02sco"/> This algorithm is based on the [[social cognitive theory]], and the key point of the ergodicity is ...cognitive agents solving in parallel, with a social sharing library. Each agent holds a private memory containing one knowledge point, and the social shari ...

6 KB (967 words) - 00:32, 10 October 2021
Maximum score estimator
...there is an additive [[errors and residuals|response error]]. Then for an agent <math> t \in T </math> , ...> \beta </math> which characterizes the effect of different factors on the agent's choice. ...

11 KB (1,690 words) - 23:22, 29 June 2021
Schelling's model of segregation
{{short description|Agent-based segregation model}} ...rst=Junfu | title=Tipping and residential segregation: a unified Schelling model | journal=Journal of Regional Science | publisher=Wiley | volume=51 | issue ...

13 KB (1,810 words) - 03:48, 10 February 2024
Incomplete information network game
...ors; take their action based on their prior belief and update their belief based on the history of the game.<ref>Song Y. and M. van der Schaar (2015) “Dynam ...ximates the distribution over a neighbors' degree from the [[configuration model]] with respect to a [[Degree (graph theory)|degree sequence]] represented b ...

9 KB (1,450 words) - 21:12, 9 October 2023
Exploration–exploitation dilemma
...en two opposing strategies. Exploitation involves choosing the best option based on current knowledge of the system (which may be incomplete or misleading), ...(2nd edition). http://incompleteideas.net/book/the-book-2nd.html</ref> The agent must decide whether to exploit the current best-known policy or explore new ...

14 KB (2,047 words) - 16:55, 29 January 2025
Abstract economy
...d model of an [[exchange economy]] in [[microeconomics]], and the standard model of a game in [[game theory]]. An ''equilibrium'' in an abstract economy gen ...Walrasian equilibrium (aka competitive equilibrium) in the [[Arrow–Debreu model]].<ref name=":1">{{Cite journal|last1=Arrow|first1=Kenneth J.|last2=Debreu| ...

19 KB (3,086 words) - 06:51, 17 January 2025
Mountain car problem
...ves a negative reward at every time step when the goal is not reached; the agent has no information about the goal until an initial success. ...ew Moore's PhD thesis (1990).<ref>[Moore, 1990] A. Moore, Efficient Memory-Based Learning for Robot Control, PhD thesis, University of Cambridge, November 1 ...

9 KB (1,230 words) - 13:36, 11 November 2024
Blackwell's informativeness theorem
...[Blackwell's informativeness theorem#Garbling|informativeness]]'', and one based in ''[[Blackwell's informativeness theorem#Feasibility|feasibility]]''. Thi ...di |last2=Safra |first2=Zvi |author1-link=Edi Karni |title=Hybrid decision model and the ranking of experiments |journal=Journal of Mathematical Economics | ...

10 KB (1,492 words) - 05:16, 11 December 2024
Truthful resource allocation
== Model == There are ''n'' agents. Each agent has a function that attributes a numeric value to each "bundle" (combinatio ...

13 KB (1,842 words) - 03:34, 16 January 2025
Natural borrowing limit
In the standard consumer utility maximization problem of the [[economic agent]], she maximizes utility by consuming goods. In making an optimal consumpti ...g constraint.<ref>Nakajima, Makoto, 2007. "Note on the Heterogeneous Agent Model: Aiyagari (1994)" [http://www.compmacro.com/makoto/note/note_im_aiyagari.pd ...

13 KB (2,095 words) - 18:46, 13 March 2023
Random utility model
...-4076(75)90032-9 |title=Maximum score estimation of the stochastic utility model of choice |date=1975 |last1=Manski |first1=Charles F. |journal=Journal of E ...able. Given that state, the agent behaves rationally. In other words: each agent has, not a single preference-relation, but a [[Probability distribution|''d ...

17 KB (2,394 words) - 22:21, 26 January 2025
Bayesian-optimal pricing
...nd of [[algorithmic pricing]] in which a seller determines the sell-prices based on probabilistic assumptions on the valuations of the buyers. It is a simpl ...rtunately, the seller does not know the buyer's valuation. In the Bayesian model, it is assumed that the buyer's valuation is a [[random variable]] drawn fr ...

18 KB (2,825 words) - 11:27, 9 December 2024
Draft:Random utility model
...|date=1975-08-01 |title=Maximum score estimation of the stochastic utility model of choice |url=https://dx.doi.org/10.1016/0304-4076%2875%2990032-9 |journal ...able. Given that state, the agent behaves rationally. In other words: each agent has, not a single preference-relation, but a [[Probability distribution|''d ...

19 KB (2,642 words) - 10:48, 26 December 2024
WARP (systolic array)
...delivered in June 1986. The first of the significantly redesign production model, the PC-Warp, was delivered by G.E. in April 1987. About twenty production One PE consists of two main agents: a Computation Agent and a Communication Agent.<ref>{{Cite journal |last1=Borkar |first1=S. |last2=Cohn |first2=R. |last3= ...

8 KB (1,187 words) - 05:45, 10 December 2024
Truthful cake-cutting
...l randomized truthful mechanism for [[fair cake-cutting]]: select a single agent uniformly at random, and give him/her the entire cake. This mechanism is tr ...us division'') is a partition of the cake into ''n'' pieces such that each agent values each piece at exactly 1/''n''. The existence of such a division is [ ...

27 KB (3,855 words) - 01:02, 16 January 2025

Search results

Navigation menu

Search