한국기록관리학회지, 한국기록관리학회

1

박하람(중앙대학교 일반대학원 문헌정보학과 문헌정보학전공) ; 김학래(중앙대학교) 2021, Vol.21, No.3, pp.61-78 https://doi.org/10.14404/JKSARM.2021.21.3.061

초록보기

초록

일본군 ‘위안부’에 대한 기록은 민간 기관에서 개별적으로 관리하고 있다. 일부 기록은 디지털 아카이브로 구축되어 온라인으로 접근할 수 있다. 그러나, 디지털 아카이브의 기록은 기관에 따라 메타데이터의 구성과 표현 방식이 다르다. 한편, 기록 사이의 관계를 정의할 수 있는 체계가 미흡하기 때문에, 현재 구축된 일본군 ‘위안부’ 기록은 서로 연결되지 않고 파편적인 형식으로 남아있다. 본 연구는 일본군 ‘위안부’ 디지털 기록을 연계하기 위한 지식 모델을 제안하고, 분산화된 디지털 아카이브의 기록을 통합하여 일본군 ‘위안부’ 지식그래프를 구축한다. 일본군 ‘위안부’ 디지털 아카이브의 메타데이터를 분석하여 공통 요소를 도출하고, 표준 어휘를 적용하여 디지털 기록의 다양한 개체와 개체 사이의 관계를 의미적으로 표현한다. 특히, 흩어져 있는 기록을 연계하고 검색하기 위해 수집한 데이터의 정제가 이루어지고, 외부 데이터를 활용하여 기록의 맥락 정보를 강화하고 있다. 구축된 지식그래프의 검증은 분산된 기록의 탐색 여부를 측정하는 질의를 통해 수행된다. 검증 결과, 지식그래프는 흩어져 있는 기록을 연계하여 검색할 수 있고, 외부데이터로부터의 강화로 기록의 맥락 정보를 풍부하게 제공하며, 의미 기반의 검색을 통해 사용자의 의도에 맞춘 정확한 검색이 가능하다.

Abstract

Records on Japanese “Comfort Women” have been individually managed by private sectors or institutions, and some are provided as digital archives on the Internet. However, records of digital archives differ in the composition and representation of metadata by individual institutions. Meanwhile, there is a lack of a consistent structure to describe the relationships between and among these records, leading to their fragmentation and disconnectedness. This paper proposes a knowledge model for interlinking the digital archival resources and builds a knowledge graph by integrating the records from distributed digital archives. It derives common elements by analyzing metadata from the diverse digital archives and expresses them in standard vocabularies to semantically describe multiple entities and relationships of the digital archival resources. In particular, the study includes the refinement of collected data to search and thread dispersed records and the enrichment of external data to provide significant contextual information of records. An evaluation of the knowledge graph is performed via a query measuring the (dis)connectivity between the distributed records. As a result, the knowledge graph is capable of interlinking and retrieving fragmented records, providing substantial contextual information on the records with external data enrichment, and searching accurately to match the user’s intentions through semantic-based queries.

2

디지털 기록의 상호운용을 위한 지식그래프의 평가

박하람(중앙대학교 일반대학원 문헌정보학과 문헌정보학전공 박사과정) ; 김학래(중앙대학교 사회과학대학 문헌정보학과 교수) 2023, Vol.23, No.4, pp.159-178 https://doi.org/10.14404/JKSARM.2023.23.4.159

초록보기

초록

디지털 아카이브는 지속적으로 보존할 가치가 있는 디지털 기록을 보존하고 활용하기 위한 온라인 플랫폼이다. 그러나 국내에서 운영되고 있는 디지털 아카이브는 기능, 메타데이터, 데이터의 기술원칙과 관련된 공통 원칙이 존재하지 않는다. 이는 분산적으로 존재하는 디지털 기록을 연계하기 힘들게 만드는 요인이 된다. 본 연구는 디지털 기록의 상호운용을 개선하기 위한 방안으로 디지털 아카이브를 위한 공통 어휘를 제안하고, 공통 어휘로 구축된 디지털 아카이브의 상호운용성을 평가한다. 1997 외환위기 아카이브의 데이터를 수집·분석하여 지식그래프를 구축하고, RiC-O로 구축된 지식그래프와 상호운용성을 비교한다. FAIR 데이터 원칙의 평가 프레임워크는 1997 외환위기 아카이브와 지식그래프를 평가하는 데 활용된다. 구축된 지식그래프는 기록의 다양한 개체가 서로 연계되고, 기록의 이해에 도움이 되는 맥락 정보를 제공한다. 검증 결과는 공통 어휘로 구축된 지식그래프가 기존 아카이브에 비해 디지털 기록의 연계와 검색, 상호운용 관점에서 향상된 결과를 보인다.

Abstract

A digital archive is an online platform for preserving and utilizing digital records worthy of continued preservation. However, there are no shared standards for functionality, metadata, or data technical principles across digital archives in Korea. These issues create challenges in linking distributed digital records. This study proposes a common vocabulary for digital archives to enhance the interoperability of digital records and evaluates the interoperability of the digital archive built with the common vocabulary. We collect and analyze data from the digital archive on the Korean financial crisis of 1997 to construct a knowledge graph and compare its interoperability with the knowledge graph built with RiC-O. The archive and the knowledge graph underwent evaluation using the FAIR data principles evaluation framework. The constructed knowledge graph links various objects in the archive and provides contextual information to aid in understanding the archive. The results demonstrate that a knowledge graph built with a common vocabulary significantly improves the linkage, search, and interoperability of digital records compared to a traditional archive.

3

FAIR 원칙: 데이터 관점의 디지털 아카이브 구현을 위한 고려사항

김학래(중앙대학교) 2021, Vol.21, No.2, pp.155-172 https://doi.org/10.14404/JKSARM.2021.21.2.155

초록보기

초록

디지털 아카이브는 디지털 자원을 보존하고 지속적으로 활용하기 위한 전자화된 저장소이다. 디지털 아카이브에 대한 이론적 연구는 활발하게 진행되고 있고, 다양한 도메인의 디지털 자원을 기록하기 위한 아카이브가 구축되어 서비스되고 있다. 그러나 디지털 아카이브의 자원은 디지털화라는 본래의 목적은 만족할 수 있지만, 자원의 검색과 재사용에 있어 여전히 제한이 있는 것이 현실이다. 본 연구는 FAIR 데이터 원칙을 자세히 살펴보고, 디지털 아카이브에 적용하기 위한 성숙도 평가 프레임워크를 제안한다. FAIR 데이터 원칙은 디지털 자원을 기계가 읽고 처리할 수 있게 만드는 일련의 지침으로 웹에 존재하는 모든 자원을 대상으로 적용할 수 있다. FAIR 데이터 원칙의 평가 모델은 계획 수립과 적용 단계를 구분해서 정의하고 있다. 그러나, 개별 원칙의 적용 여부를 평가하기 위한 명확한 기준이 모호하고, 디지털 아카이브 분야를 위한 평가 기준에 대한 논의가 미흡하다. 본 연구는 디지털 아카이브에 FAIR 데이터 원칙을 적용하기 위한 프레임워크를 제안하고, 향후 적용을 위한 이슈를 논의한다.

Abstract

Digital archives are electronic storages used to preserve and utilize digital resources sustainably. Theoretical research on digital archives is being conducted actively, and digital archives for recording various resources in heterogeneous domains are being built and serviced. However, although the original purpose of digitizing the resources of digital archives is achievable, the discovery and reuse are still limited. This study examines the Findable, Accessible, Interoperable, and Reusable (FAIR) data principles in detail and proposes a maturity assessment framework for digital archives. The FAIR Data Principles is a set of guidelines that enable machines to read and understand digital resources that are applied to any online resource. The evaluation model of the FAIR data principle defines the planning and application stages separately. However, criteria for evaluating the application of individual principles are still ambiguous, and discussions on evaluation criteria for the field of digital archives are insufficient. This study proposes a framework for applying the FAIR data principle to digital archives and discusses issues for future application.

4

1997 외환위기 지식그래프: 디지털 아카이브의 관계 중심적 접근

이유경(중앙대학교 일반대학원 문헌정보학과 기록관리전공) ; 김학래(중앙대학교) 2020, Vol.20, No.4, pp.1-17 https://doi.org/10.14404/JKSARM.2020.20.4.001

초록보기

초록

정보기술의 발전에 따라 아카이브의 디지털화가 가속화되고 있다. 그런데 전통적인 방식의 디지털 아카이브는 기록을 효과적으로 검색하고 연계하고 이해하는 데 한계가 있다. 본 논문은 디지털 아카이브의 활용성을 극대화하기 위한 방안으로 관계 중심의 지식그래프 방식을 제안한다. 디지털 아카이브의 사례인 ‘1997 외환위기 아카이브’의 특징을 검토하고, 아카이브에 포함된 모든 개체와 개체 사이의 관계는 RiC-O(Records in Contexts-Ontology) 기반의 지식그래프로 구축한다. 본 연구의 결과인 외환위기 지식그래프는 1997 외환위기 아카이브의 모든 개체를 기계가 처리할 수 있는 형식으로 구축한다. 디지털 아카이브와 비교해 지식그래프 접근은 개체의 정보, 개체 사이의 관계를 정확히 탐색할 수 있고, 이를 통해 의미검색, 지능형 서비스에 활용될 수 있다.

Abstract

Along with the development of information technology, the digitalization of archives has also been accelerating. However, digital archives have limitations in effectively searching, interlinking, and understanding records. In response to these issues, this study proposes a knowledge graph that represents comprehensive relationships among heterogeneous entities in digital archives. In this case, the knowledge graph organizes resources in the archives on the Korean financial crisis of 1997 by transforming them into named entities that can be discovered by machines. In particular, the study investigates and creates an overview of the characteristics of the archives on the Korean financial crisis as a digital archive. All resources on the archives are described as entities that have relationships with other entities using semantic vocabularies, such as Records in Contexts-Ontology (RiC-O). Moreover, the knowledge graph of the Korean Financial Crisis of 1997 is represented by resource description framework (RDF) vocabularies, a machine-readable format. Compared to conventional digital archives, the knowledge graph enables users to retrieve a specific entity with its semantic information and discover its relationships with other entities. As a result, the knowledge graph can be used for semantic search and various intelligent services.

5

“코로나-19:우리의 기억”: 코로나바이러스 감염증과 사회변화에 대한 디지털 아카이브

김학래(중앙대학교) 2020, Vol.20, No.4, pp.229-236 https://doi.org/10.14404/JKSARM.2020.20.4.229

초록보기

초록

코로나바이러스감염증은 인류사회가 경험하지 못한 커다란 충격과 생활양식의 급속한 변화를 만들고 있다. 비대면 사회는 감염병 확산을 예방하기 위한 과정에서 보편화된 사례이다. 코로나바이러스감염증으로 인한 사회적 영향은 광범위하다. 정부의 정책, 개인정보보호, 정보기술 등 다양한 이슈가 사회 전반에 영향을 주고 있다. 동시에 관련 사건과 이슈가 신속하고 빠르게 변하기 때문에 사실 정보를 추적하고 기록하는 것이 어렵다. 코로나-19와 실시간성 정보를 효과적으로 기술하기 위한 방안은 무엇일까? “코로나-19:우리의 기억” 프로젝트는 코로나바이러스감염증에 대한 사회문화적 영향을 가치중립적으로 기록하기 위한 시도이다. 주요 사건과 이슈를 분야별로 수집하고, 중립적인 관점으로 핵심이벤트를 기록하며, 모든 기록을 탐색할 수 있도록 디지털 아카이브로 구축한다. 프로젝트를 통해 수집, 구축한 모든 데이터, 소스코드, 시각화를 포함하는 애플리케이션은 모두 공개하여 새로운 협업을 이끌어내고 있다.

Abstract

In light of SARS-CoV-2’s significant impact, human society has experienced rapid changes in lifestyle that it has not yet experienced before. One way this virus has influenced people’s lives is the emergence of the zero-contact society, an initiative for preventing the spread of infectious diseases. As can be seen, the social impact of COVID-19 is widespread. Various issues, such as those about government policy, personal information protection, and health care, are affecting society as a whole. At the same time, factual information is difficult to track and record because of the rapid and transient nature of related events and issues. As such, a method of effectively describing COVID-19 and real-time information is necessary. The “COVID-19: Our Memory” project is an attempt to record the sociocultural impact of the coronavirus infection. This project collects major events and issues classified into several subjects, records those events from a neutral point of view, and develops a digital archive so that all records are accessible. All the data collected and built through the project, the application, including the source code and visualization, are all published to bring about new opportunities for collaboration.

바로가기메뉴

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

한국기록관리학회지