One thing to know about data scientists are (1) there's the job posting and (2) ...

m_kos · on March 22, 2024

Thank you for the advice! Regarding building a RAG, do mean making one based on, e.g., the this example? https://python.langchain.com/docs/expression_language/cookbo...

p1esk · on March 22, 2024

I wouldn’t waste my time learning about RAG. It’s quite likely GPT5 will make all current methods obsolete.

nborwankar · on March 23, 2024

That’s not quite correct.

Private data especially in the enterprise cannot use public LLM’s like GPT-4 or 5 or N. Use cases needing data privacy have to use an internally implemented LLM application. In Currently, RAG is a concrete and pragmatic enterprise use of LLM’s aside from summarization, which is not amenable to using GPT-4.

GPT-5 may very well be amazing. But unless it runs on-prem it can’t be used in many scenarios because of data privacy.

To the OP - learning how to run LLM’s locally via say Ollama (see ollama.ai) will get you started in a hands on manner. See the /r/LocaLlama subreddit for a very active community around running LLM’s locally.

p1esk · on March 23, 2024

I think you’re missing my point. RAG is something that you currently implement yourself, or pay someone else who already implemented it. With GPT5 (or GPT6 at the latest), you just give it the same access you would give to a RAG system, and describe what you want it to do. It will do the rest.

Edit: I’m assuming the scenario where you do want to use the best model.

m_kos · on March 23, 2024

What would you recommend learning instead?

p1esk · on March 23, 2024

Learn whatever interests you the most.

BOOSTERHIDROGEN · on March 24, 2024

What it is considered not to be a waste of time?