Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: de Carvalho, G H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.18953  [pdf, other

    cs.HC

    Evaluating Front-end & Back-end of Human Automation Interaction Applications A Hypothetical Benchmark

    Authors: Gonçalo Hora de Carvalho

    Abstract: Human Factors, Cognitive Engineering, and Human-Automation Interaction (HAI) form a trifecta, where users and technological systems of ever increasing autonomous control occupy a centre position. But with great autonomy comes great responsibility. It is in this context that we propose metrics and a benchmark framework based on known regimes in Artificial Intelligence (AI). A benchmark is a set of… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  2. arXiv:2407.11068  [pdf, other

    cs.AI cs.CL

    Show, Don't Tell: Evaluating Large Language Models Beyond Textual Understanding with ChildPlay

    Authors: Gonçalo Hora de Carvalho, Oscar Knap, Robert Pollice

    Abstract: We explore the hypothesis that LLMs, such as GPT-3.5 and GPT-4, possess broader cognitive functions, particularly in non-linguistic domains. Our approach extends beyond standard linguistic benchmarks by incorporating games like Tic-Tac-Toe, Connect Four, and Battleship, encoded via ASCII, to assess strategic thinking and decision-making. To evaluate the models' ability to generalize beyond their t… ▽ More

    Submitted 18 August, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

  3. arXiv:2208.07143  [pdf, other

    cs.AI math.LO

    C-Causal Blindness An experimental computational framework on the isomorphic relationship between biological computation, artificial computation, and logic using weighted hidden Markov models

    Authors: Gonçalo Hora de Carvalho

    Abstract: This text is concerned with a hypothetical flavour of cognitive blindness referred to in this paper as \textit{C-Causal Blindness} or C-CB. A cognitive blindness where the policy to obtain the objective leads to the state to be avoided. A literal example of C-CB would be \textit{Kurt Gödel's} decision to starve for \textit{"fear of being poisoned"} - take this to be premise \textbf{A}. The objecti… ▽ More

    Submitted 19 September, 2022; v1 submitted 15 August, 2022; originally announced August 2022.

    Comments: Changes to experimental methodology and general rewrite