Skip to main navigation Skip to search Skip to main content

Exploration of the Impact from Early Injection of Symbolic Knowledge into a Language Model

  • Jose Carlos Machicao

    Research output: Chapter in Book/ReportConference contributionpeer-review

    Abstract

    Large Language Models (LLMs) have demonstrated remarkable capabilities but continue to exhibit fundamental limitations in areas humans find intuitive, such as spatial reasoning. This paper investigates whether early injection of symbolic knowledge during training could address these limitations. Unlike post-training alignment techniques, our approach aims to influence how concepts are initially represented within model parameters. Through controlled experiments with a very simple model, word2vec models, we demonstrate that early injection of spatial symbolic knowledge produces qualitatively different representations of spatial concepts, particularly regarding dynamic relationships and potential movements. Models trained with this approach show enhanced understanding of spatial dynamics, capturing not just static positions but causal consequences. While quantitative differences were modest in our small-scale experiment, the qualitative improvements in semantic retrieval suggest promising directions for integrating symbolic knowledge in more complex language models. This work contributes a novel perspective on the timing of symbolic knowledge integration, challenging the prevailing paradigm of large-scale pretraining followed by alignment.

    Original languageAmerican English
    Title of host publicationProceedings of 20th Iberian Conference on Information Systems and Technologies (CISTI 2025) - Volume 2
    EditorsAlvaro Rocha, Francisco García Peñalvo, Carlos J. Costa, Ramiro Gonçalves
    PublisherSpringer Science and Business Media Deutschland GmbH
    Pages801-812
    Number of pages12
    ISBN (Print)9783032107206
    DOIs
    StateIndexed - 2026
    Event20th Iberian Conference on Information Systems and Technologies, CISTI 2025 - Lisbon, Portugal
    Duration: 16 Jun 202519 Jun 2025

    Publication series

    NameLecture Notes in Networks and Systems
    Volume1717 LNNS
    ISSN (Print)2367-3370
    ISSN (Electronic)2367-3389

    Conference

    Conference20th Iberian Conference on Information Systems and Technologies, CISTI 2025
    Country/TerritoryPortugal
    CityLisbon
    Period16/06/2519/06/25

    Bibliographical note

    Publisher Copyright:
    © The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.

    Keywords

    • Cognitive Architectures
    • Early Training Intervention
    • Language Models
    • Neural-Symbolic Systems
    • Spatial Reasoning
    • Symbolic Knowledge Integration

    Fingerprint

    Dive into the research topics of 'Exploration of the Impact from Early Injection of Symbolic Knowledge into a Language Model'. Together they form a unique fingerprint.

    Cite this