ChatGPT와 검색 그리고 인간

1.
ChatGPT가 등장한 이후 이곳저곳에 영향을 주고 있습니다. 직접적으로 영향을 받는 분야중 하나가 검색엔진이라고 합니다. 이런 이유 때문에 뉴욕타임즈를 비롯한 여러 매체들이 구글의 동향을 전합니다.

생성 인공지능 모델의 위협을 가장 크게 느끼는 분야는 단연 검색엔진이다. 방대한 데이터를 바탕으로 학습한 덕에 뛰어난 검색 능력을 갖춘 챗지피티는 이용자가 질문을 입력하면 정제된 맞춤형 답변을 친근한 대화체로 내놓는다 . 반면 검색엔진을 이용해 정보를 찾으려면 키워드를 입력한 뒤 나오는 수많은 자료에 일일이 들어가 개중에 쓸모있는 것을 이용자가 직접 추려내야 한다 . 챗봇이 내놓는 정보가 얼마나 믿을 만한지는 논외로 하더라도 , 챗봇을 활용하면 정보 습득 방식이 훨씬 간편하고 매끄러워질 것이라는 점은 부인하기 어렵다 . 텍스트보다 영상과 이미지 에 익숙한 엠제트 (MZ) 세대가 궁금한 걸 검색할 때 구글이나 네이버 검색창 대신 유튜브나 틱톡 , 인스타그램 등 을 찾는 것처럼 , 앞으로 ‘챗봇 네이티브 ’ 세대가 등장한다면 전통적인 형태의 검색엔진은 설 자리가 사라질 수 있다 .
1초가 다르게 똑똑해지는 챗GPT…‘오케이 구글’ 시대 끝나나중에서

위 기사의 출처라고 할 수 있는 뉴욕타임즈의 A New Chat Bot Is a ‘Code Red’ for Google’s Search Business는 Code Red를 내린 구글 경영진의 고민을 다음과 같이 정리합니다. 뉴욕타임즈 기사는 유료이기 때문에 A new chabot is a ‘Code Red’ for Google’s Search business의 기사를 옮깁니다. 핵심은 수익모델입니다. ChatGPT와 같은 서비스로 수익모델을 만들 수 있는가라는 질문입니다.

Google has spent several years working on chatbots and, like other Big Tech companies, has aggressively pursued artificial intelligence technology. Google has already built a chatbot that could rival ChatGPT. In fact, the technology at the heart of OpenAI’s chatbot was developed by researchers at Google.Called LaMDA, or Language Model for Dialogue Applications, Google’s chatbot received enormous attention in the summer when a Google engineer, Blake Lemoine, claimed it was sentient. This was not true, but the technology showed how much chatbot technology had improved in recent months.

Google may be reluctant to deploy this new tech as a replacement for online search, however, because it is not suited to delivering digital ads, which accounted for more than 80% of the company’s revenue last year.
“No company is invincible; all are vulnerable,” said Margaret O’Mara, a professor at the University of Washington who specializes in the history of Silicon Valley. “For companies that have become extraordinarily successful doing one market-defining thing, it is hard to have a second act with something entirely different.”

ChatGPT를 잠깜 이용한 적이 있습니다. 논문 읽기(How to read paper)을 쓰면서 한번 사용해보았습니다. 영어로 정리한 결과물은 놀라왔습니다. 또다른 결과물도 놀라웠습니다. LinkedIn에 올란 글입니다. Gautier Marti가 ChatGPT를 이용하여 72쪽 분량의 보고서를 만들었습니다. 모든 결과물을 ChatGPT의 질문과 결과를 모아서 만들었습니다. 또한 관련한 글을 SSRN에 공개하였습니다.

From Data to Trade: A Machine Learning Approach to Quantitative Trading

Download (PDF, 14.19MB)

2.
결과물을 보고 어떤 생각이 드시나요? “놀랍다”는 반응이 대부분일 듯 합니다. 여기서 다시금 구글 검색으로 돌아가 보죠. 결과물을 보면 ChatGPT는 네이버 지식iN의 인공지능버전이라는 생각이 듭니다. 다만 지식iN은 집단지성이라는 과정을 통하여 결과에 접근하고 질문자가 취사선택을 합니다. 선택이라는 과정을 통하여 인간은 개입하고 정보를 받아드립니다. 반면 ChatGPT는 읽기, 이해 그리고 선택이라는 과정을 생략하고 이를 ‘기계학습’이라는 과정으로 대체합니다. 인터넷상에 수많은 데이타를 알 수 없는 기준으로 학습한 결과를 받습니다. 인간에게 주어진 선택은 Yes 아니면 No입니다. 물론 정답이 필요한 사람에게 가장 높은 수준의 정보를 제공해주는 느낌입니다.

구글은 수익모델을 고민합니다. 검색이라는 수요를 가진 이용자들의 요구 또한 하나일 수 없습니다. 어떤 이는 빠른 결과=정답을 원합니다. 어떤 이는 선택, 분석 및 정리라는 과정을 원합니다. Seach Engine과 ChatGPT은 일정 부분에서 역할이 겹칩니다. 겹치는 부분에선 ChatGPT가 뛰어난 경쟁력을 가집니다. 수익모델만 빼고.

기계학습을 이야기할 때 항상 언급하는 주제가 있습니다.

“Machine Learning Bias (AI Bias)”

검색엔진은 데이타의 편향을 인간을 통하여 보정합니다. 물론 인간도 편향성을 가지기 때문에 어쩔 수 없지만 그 또한 인간의 선택입니다. 반면 기계학습은 데이타가 문제입니다. 영어와 한글로 입력한 ChatGPT의 결과가 똑같지 않은 이유이기도 합니다. 여기에 PC(Political Correctness)와 관련이 될 수 있는 주제는 더 민감한 결과일 듯 합니다.

패러다임(paradigm)이라는 표현을 많이 사용합니다. 어떤 한 시대 사람들의 견해나 사고를 근본적으로 규정하고 있는 테두리로서의 인식의 체계, 또는 사물에 대한 이론적인 틀이나 체계를 의미하는 개념입니다. 이를 지식의 생산과 데이타의 많고 적음에 연결할 수 있습니다. 표준 우주론은 암흑물질과 암흑에너지의 존재를 전제합니다. 최근 연세대 이영욱 교수팀은 암흑에너지의 존재에 의문을 제기하는 논문을 발표하였습니다. 새로운 관측이 나오고 이를 새롭게 해석하면서 가설이 등장합니다. ChatGPT에 표준우주론에 대해 질문을 해보죠. 역시나 이런저런 토론과 쟁점을 담지 못합니다.

ΛCDM (Lambda Cold Dark Matter) is a model of the universe that describes its evolution and the formation of its structure. It is based on the assumption that the universe is composed of three main components:
Cold dark matter: a type of matter that does not interact with light and does not move at high speeds, but it does have mass and it can clump together under the influence of gravity.
Dark energy: a mysterious form of energy that is thought to be responsible for the acceleration of the expansion of the universe.
Baryonic matter: the “normal” matter that we are familiar with, made up of atoms and the particles that make up atoms (protons, neutrons, and electrons).
The ΛCDM model is supported by a number of observations, including the cosmic microwave background radiation, the large scale structure of the universe, and the observed abundances of light elements. It is the standard model of cosmology and is widely accepted by the scientific community.

시험문제를 풀 때 위와 같은 답변은 도움을 줍니다. 다만 질문하고 비판하고 새로운 것을 찾는 과정에서 ChatGPT의 역할은 제한적일 듯 합니다.

영원한 정답은 없습니다. 기계의 시대에 인간의 역할이 무엇일지 스스로 고민할 때입니다.

生死苦海 (생사고해)
生死苦兮 (생사고혜)
獨步世間如犀角(독보세간여서각)

무소의 뿔처럼 혼자가라, 다만 인공지능과 함께…(^^)

(*)Why ChatGPT is not a threat to Google Search는 LLM Based Search Engine의 문제점을 크게 두가지로 정리합니다.

첫째는 정확성(Addressing the truthfulness of ChatGPT’s output will be a major challenge)
둘째는 데이타 갱신(Another challenge ChatGPT and other LLMs face is updating their knowledge base)
셋째는 속도(LLMs also have an inference speed problem)
넷째는 수익모델(the biggest challenge of an LLM-based search engine is the business model)

이 글 공유하기:

Leave a Comment 응답 취소