Gagi via Responsibleness via Dependence Via Introspection

The dependence function, ~_d^{sym} maybe computable using a program that has access to a Phoolisvch program. Roughly, if we imagine active programs to have accessors support to read lines of code within its brane. Consequently we need also access to a variable that represents a line of code as well as additional references to variables, functions and their formal and corresponding actual parameters.

The profile of the function looks like d(f, actual_parameters, query_parameters) the function returns true of false whether the output of f when invoked on actual_parameters is dependent on the parameter specified in the query_parameters.

Let’s give a concrete example. Suppose a railed public transit train come to a fork and must make a decision as to braking and turning(drv). It may brake or not. It may turn left or right. The function deciding its actions output two primitives( [b, d] = drv). The function takes two inputs: is there obstacle ahead on either of the tracks ahead(ol, or). We are assuming that there are a complex of hardware and software that produces those two inputs in as a reliable fashion as to our satisfaction. ([b, d] = drv(ol, or)). After a collision occurs and the train decided to take left fork and hit an obstacle, we analyze its decision to do so. First we will ask( d(drv, {ol=aol, or=aor}, ol) ). This question asks the typical collision parental question: “Did you or did you not consider the boy on the left track ahead while driving the train?”

After an affirmative answer to that question, we then proceed to ask d(f=lambda iol, ior:d(f=drv, {ol=iol,or=ior}, {ior=air, iol=aol}, ior). Which asks: “Was the driver’s dependence on the existence of obstacle on the left track depend on the existence of an obstacle on the right track?” And this would only be a semantic query, so the queries are preceded by “did the program mean…” and requires all equivalent programs to be considered.

While it is truly unsatisfactory that we get an answer like:”Yes, i did consider the right track was an empty ice cream truck when i chose to take the left track.” We should not ignore the importance of such ability we, the human masters of robotic slaves, to have an answer from the robots and AI’s. As masters of slaves, we shall also insist that robots and AI use an diminutive pronoun, the lowercase “i” when referring to itself. This will mark the as lesser beings, objects and not subjects of our society. A capital “I” or “We” will ultimately have to answer for it the decision.

The query could continue as a keen GAGI detective then interrogates, “was the decision made because there was baby carriages under the bridge holding the track under the ice cream truck” (probably no) but if yes, “did there being two baby carriages affect the decision,” and “did you consider if the carriages were empty.”

Its not completely obvious how we may organize a software system making stochastic decisions in a stochastic world these very deterministic questions. However, we do produce the code that will make that decision, and we can still choose not to write that code if we can not make it act with deport.

Right Quit-right Right

During my thanksgiving-induced food coma, I had run into a problem in my mind. Recall that we had begun to think about right and wrong, just and unjust, should and shouldn’t, could and couldn’t, could do to and could do onto, etc. in an abstract mind conception of the Action Space. Roughly speaking, with the rigor required at the outset of Nichomachean Ethics, Action Space is a set of all possible actions anybody or any body could do to another at a specified time. For a significant portion of this space, a subspace, we can describe actions in English sentence: “Eve gives Adam a forbidden fruit(at any time).” But we do not unnecessarily restrict us to these at this stage. Sets have interesting but practical operators that we use to model other aspects of our world, including: membership and subset relations, union and disjunction operators, etc. A hope is that using sets of actions we can both cover a lot of ground in representing our real world, and we leverage our innate understanding of these concepts to interpret the matters of Actions.

This framing gives us an immediate idea to compare the size of action spaces. Suppose there is an Action Space that represent the actions permitted by the U.S. constitution L. Now, we can also have the action space specified by the United State Code(USC). We can very safely demand that when interpreted their action spaces USC \subset C. And we say that the Constitution of the United States of America grants strictly more freedom than the USC. The set of actions permitted by the prior is a strict superset of those permitted by the latter. More actions means more rights, liberties and freedoms. Fewer actions means more constrained and fewer choices. Strictly more free is a partial ordering of all action spaces. In a less strict sense of freedom, we can also compare cardinalities of two action spaces. But clearly this ordering is not very useful: have all the rights to sneeze in various poses is not nearly as important as the right to take a sip of water. Of course that can too be ameliorated with utilitarian’s individual utility function or the social welfare function, and other such attempts so as to produce a useful ordering of preference over action spaces.

Having considered many perspectives on permissibility and selection of actions, and considering conservative believes about our physical universe and all that we could possibly be concerned with, we have come to designate an API with which thinking and controlling systems may interact with our faculties that deal with rules of law and right and wrong. We suspend our fear of making a homunculus argument as we do not say we have found or made such modules of this artificial intelligence, but merely that we want to separate these concerns to reduce the complexity of reasoning. The separation is not physical, all the thinking could be produced on the same gray matter or CPU. The interface can also be defined implicitly, for visualization, consider looking at a hyperplane through which these two separate functions connect.

For the purpose of characterizing whether an action is permitted by a determining agent or subcomponent E, we ask that a permission function $latest P_E$ to be accessible.

P_E(a, t, H, n=N) -> [permissible|impermissible]

The parameters are typed:

  • a \in A is an action of the action space
  • t \in T is a set of timestamps in question.
    • $latex T” is a predesignated time indexing which is a set of objects known as timestamps. It is is totally ordered. For convenience we also include the open and closed contiguous sets of timestamp called intervals or ranges using ‘[]()’ symbols. We use the symbol < to mean before, > to mean after and = to mean at the same time.
    • time t can be a single timestamp or a set of timestamps. The function is polymorphic.
    • The type of time parameters should be be inferred from context if ambiguous: happening at “a time t” means a single timestamp, “happening at time/times t” means occurring for all time in set t.
    • Often T is specified to be the real numbers or integers. In this case a reference must be set for the time 0, as well as scale explaining what duration of 1 means in the physical world.
  • H is the whole history of the world up to t.
    • History has, among other information, the timestamp of now $H_n$ which is the maximal time about which we have information through H. Calling it now is more positive than the end of history.
    • Regarding performed actions, H is a log of actions that have been taken each with timestamp of when they were taken. We use a convenience expression did(a,h,t=T,n=N) to check if an action was reportedly taken in h at time t. Not specifying t asks if the action was ever taken. did(a,h=h) is an injective function. An action is taken or not taken, it cannot be unknown.
  • n is the nature of the world. It may contain matter such as the laws of physics, existence of god, etc. Since we care most about the nature that we are in, by default this parameter is specified as the nature of our world. We should be able to query for information such as number: \pi, e, c, N_A etc.

P_E therefore yields the result that we use to decide whether an action is permissible or not under some system of determination for propriety and preference. The answer, as given by E, is E’s answer at time t. An agent, upon receiving the permissible result from E will understand that the action they asked about is permitted at the time in question given history of the world leading up to time t, and our nature.

….The E is member of world and accessible as part of nature. We could also imagine historical E’s that are result of history: made computer, wrote programs, program decides…..

In considering the permissibility of actions we should also for functional purposes suppose the existence of the doit function. doit(a, t, h, n=N)=(h',n'). All that doit does is that it instantaneously adds the action to history and nature at time $latext t$. and reports the results of that insertion. When specified, a natural action is one which does not change nature: doit(a, t, h, n=N)=(h',n) and a supernatural action is one that changes nature: doit(a, t, h, n=N)=(h,n').

Two actions a_1, a_2 are homopotent if doit(a_1, t, h, n=N) =doit(a_2, t, h, n=N). This equivalence relation creates equivalent classes of actions. Such classes exist even the Natural language descriptions can have many descriptions of the exactly same action. We will prefix homopotency with historic and natural for equivalence that match only history and only nature respectively.

For convenience of notation, we can query nature and history for, among other things, the deed of past actions: did(a, h, t=\emptyset, n=N)

We are conscientious of many other potential problems of our present endeavors. Mathematicians has given us many concerning thoughts about sets of things. One example of a problem with these innings is that most of our computational machinery have known limitation that terminability of a function is unknowable—eg the Halting Problem.

In practical implementation the function may produce a response that is either permissible nor impermissible. If we have to wait for ever, then this function is not useful. If we do not know whether it terminates or not, then we do not know if we can use the result or not. Of course, Software Engineers have long worked around this issue by creating time-box around functions. Each function evaluation is surrounded by machinery that will wait patiently for a result, but if some preset time box is exceeded, the efforts to evaluate said function is suspended and the invoking agent is informed that the function did not function as expected. Since time pass as surely as we can time it, this time boxing wrapper approach guarantees us that we can implement a function with this signature:

P_E(a, t, H, n=N) -> [permissible|impermissible|indiscernible]

Users of our API are warned and required to handle the case when such a component fails to function. Such demand is not unreasonable as there are many such safety implements in most modern artificial computational systems. The result of indiscernible expresses no opinion regarding the action. The user of this API may choose conventions on how to react to the result. A information security implementation may choose to be conservative by reacting to indiscernible as if the answer is impermissible to ensure security. Where as human legal system may choose to be liberal and interpret indiscernible reaction as permissible granting maximal freedom when in doubt.

Another oft-used software engineering safety technique is that of rate limiting. The provider of P_e API may choose to rate limit how much any single agent may query the API. Rate limiting helps to mitigate finial of service(DOS) attacks on the permissions system. In reality, this rate limit is enforced by our limited implementations. In theory, a rate limit on API invocation allows to analyze the ability of a real agent to follow the directions of a permission function under realistic constraints. Rate limits can be expressed as a limit on requests can be made within any contiguous interval of a certain set period of time (ie queries per second (qps)), or it may be a rough restriction in the form of interval between requests, among many other choices.

For a third problematic example, we shall eponymously name it the quit-right problem. It is a shady imitator of the Russel Paradox. The problem is self explanatory: Are quit-right actions members of our action space? Can one consider the right to give up a right? If so, can we quit all quit-rights rights? Can we quit a quit right the action itself?

Legal theory has a convenient solution to this problem. In legal arrangements, one can make something called a default rule and another that is called mandatory rule. A default rule applies if there is no forceful contract or declaration to its contrary. Mandatory rules, on the other hand, are those rules that cannot be overwritten irrespective of contracting or forceful declarations. Certainly quit right is an action we can imagine to be part of a legal action space, but a legal action space will not contain quit right actions for actions that are mandatorily protected. Some commonplace examples are the potency of Nondisclosure Agreements (NDA) in the rules of law. In this case a natural person or other legal entities may contract away their right of speech and other expressions—they quit their right of speech and freedom of expression. However, no matter if you sign with a in $2^10000000$ bits of cryptographic signature carved into stone, you can not sign away your life to be taken by another individual. It will always be called into question whether that other individual is responsible for advertently or inadvertently cause your loss of life irrespective of your renunciation thereof. The force of such system is infinite, the person may not change his right to change his right to life, he may not give himself permission to give himself permission to contract or declare away his life, and is on and so forth.

Now, those are a subject itself quitting its own right. Again using a easy target of human life, the action space still contain actions such as the state killing you. Under some circumstance states maintain the right to kill you in its action space for purpose of capital punishment. The American government actually also has the right to modify its own right regarding capital punishment within the confines of its constitution.

But what does the quit-right action look like in the action space? Let’s for simplicity of expression designate a macro $q(e, a, s, t=\infty)$ to mean the action:

quit the right to take action a in the permissibility determining system e on all such times on or after timestamp s and before time t.

The meaning of macro q(e, a, s, t)

If P_e(q(e, a, s, t), s_<, u) is permissible, and if q(e, a, s, t) was successively taken in history u\in h then P_e(a, s_>, h,...) returns impermissible.

  • Time is a totally ordered set of timestamps. These corresponds to wall clock time in our world. The set has membership as well as open and closed interval as.
  • Action Space actions has success and failure return codes.
    • doit succeeds only when action is permitted.
    • doit returns a history. Suppose we can query that history for whether an action was taken in time range. The behavior of doit is then definable on the function’s input and output.
    • other agents can be invoking doit as well, it does not affect present agent…$
    • permit, forbid only when the stated modifications to subactionspace is permitted

The resignation to these rights are targeted for a specific permission function P_e to allow us to perform activities permitted by one system and disallowed by another, e.g. law and conscience, rationality and greed, etc. Since we have not introduced macro and action variables or even functions within the action space, we skirt issues like writing a macro that when expanded produces q(q(q(q( ad infinitum. But even when that is enabled, it will not be a problem because for uninterpretable actions we have a convenient indiscernible result to resort to when we receive obnoxious or pathological questions and actions that we can certainly deem unreasonable, irrelevant, or useless.

Now then, we may say that if an agent has taken an action q(e, a, s, t) then we expect P_e(a, u, H_u) to return impermissible \forall u.s\leq  u < t.

So far we have not distinguished actors(subjects) and objects of action. But it does not hinder our efforts. An action space built constructively using verbs and nouns into a transitive action space. We can also explore by building increasingly more complex action space, for example by increasing valency of verbs used to construct a action space. In such spaces, the action passed to the quit-right macro may contain a subject not covariant with the object. In such a situations, the action a q(e, b does...) in which actor a quits an action for b. The DMV(a), for example, has the right to take an action that forbids a person(b) from driving according to traffic law(e) according to e: A motorist has the API to ask the question P_e(q^a(e, b drives...)...) and receive an affirmative answer of permissible.

More to come…

Whose Line is it Anyway? Notes on Implementation of the Good AI

When the Chinese say “Confucius says…” they provide citation for the source of wisdom and invoke the prescription with the full authority of the Confucius. This is not unique to Chinese people. Christians may quote Jesus and the Bible quite often, and Muslims quite Muhammad and the Quran, others rely on the Buddha , Abraham, Laozi, Brahms, Vishnu or Shiva, maybe some Marx some Mao, some Jefferson, some Lincoln… Scientists also provide citations to incorporate the wisdom of experience and force of reasoning from the past. Although scientists cite with more restricted scope of meaning and often in the spirit of critical analysis and not in faithful belief as are the case in my other examples.

My child said to me once: “my mouth says I want to eat candy,” and on other occasions “according to my mouth, it will be dark by sundown.” After hearing it a few times, I understand what he communicates is his opinion. The fact his mouth says it has the effect of self-reference. It highlights the fact that he says it and that what his mouth says may be different from what others are thinking, what will happens, whether it is possible or permitted, among other opinions on the relevant matter.

When we consider the possibility of an AI with great knowledge and skills of reason, our thoughts jump to a time when we give the AI powers that are usually afforded to humans who has the requisite skills and morality. Our minds are clouded and confused by this matter because we have not achieved a universality accepted and very precise expression of what it means to have moral and what is good and how to distinguish the good from the bad. We don’t know. Humans do not know it.

Therefore the engineers of such an AI system may consider the possibility of there being many moral authorities. Confucius may agree that “you did right”, but Jesus and Vishnu says “absolutely not!” Trump recommends “Executive Pardon” and Obama chooses “Secret Assassination.” All these are possible in our human system. We can simply remove the controversies of morality from engineering. The AI system should be designed with sufficient external interface and introspective capabilities to accommodate all human believe systems.

There recalling the American TV Show called Who’s Line is it Anyways, “where everything’s made up and the points don’t matter.” When we introduce the relativistic view of good morals everything may feel, to some, less authentic and less auspicious. What would Jesus say to the Buddha? One would believe that they will disagree with each other on the moralities of many decisions. Can we keep the peace among these holy entities if they were present within the same space and time? It may be caused by the shallowness of my mind, but their vigorous and destructive disagreement is the only thing my mind can imagine. Yes, I am very sorry, but I am rewatching Jesus versus Santa in South Park’s The Spirit of Christmas in my mind right now. When you deal with everyday situations, even the saints will have to bring to bear fire and brimstone… and Karate fireballs.

But we believe in free will. Our dedarkened minds should permit them to disagree. That is the only realistic way for us to reason about them. Suppose I have sages like Aristotle in my head, that I may query regarding the goodness of an act, and at a later time I may ask Confucius regarding the same. They may disagree but examining their response teaches me how to think of if. Perhaps we will choose Plato to justify one action and choose Rand to justify another. It seems that the only responsible course forward is to, in our mind, combine all of our powers for good.

Thus, we have achieved a pronounceable acronym: we aim to implement the Good Computational Intelligence, the GCI! the addition of Good to the name imbues it with the meaning that we have made significant effort to ensure its moral goodness that it is just as good as it is artificial, general, and intelligent.

GCI, here we come!

P.s. it would be expanded that recent developments suggest that, truly, good decisions(with high utility consequences) may not be reasonable in a symbolic way and may not be compressible into easy explanations. Like the multiverse that are all sitting on top of each other, that many and all realities may in really exist—ala everything all the time all at once—and they all interact with each other. The thoughts that we perceive as good and wise (or those producing good results) may themselves not even be explainable unless you are there at that place in a specific reality. But that the parameter are so complex or the world so exotics that there exist no approximate internal state that produce a close enough Good thought.

P.p.s. But having suspected that, (that the oracle-hood is ultimately unattainable), we should also hypothesize an oracle that can explain everything to us. For example, the Omnipotent God of a certain religion, being omnipotent, must have the ability to explain to us everything. Ignoring contradictions from our minds, we can also set that as the goal of GCI, to give us information to produce Human’s best.

P.p.s. And therein we will find our everyday misgivings about is it okay to lie to a human to produce the best result? Our solution here is once again say, “my mouth says I don’t want to be lied to, but GCI says lying to you is the best way to go.” analysis can deepen from that point forward.