Uncategorized

OpenAI, LLM – Q*, AGI, Self Taught Reasoning, Optimizing, Synthetic data

December 3, 2023

777

Trying to catch up on all the news around what happened at OpenAI. There are a lot of rumours around Q*, AGI, Self Taught Reasoning and Optimizing, Synthetic data and related algorithms.

It all revolves around LLM’s reaching a limit due to lack of training data. Synthetic data can be a possible solution, something I have been talking about for a while. Good quality synthetic data can improve models. If the constraints, parameters, distribution can be defined by a combination of human and algorithmic input, we can create better quality training data for algorithms. Add to that automated self-learning models self-generating training data to improve further in a continuous loop of trial and error, you have the potential for creating AGI type systems in the future.

Will be interesting to see how much of this will be realized in the next 5-10 years and its potential impact on the global economy across all industries. Given the impact there will be lot of debate between doomers vs accelerationists. I am all in favour of accelerationism with sensible guards. Innovation and technological progress has rarely been stopped and eventually created disruptive improvements for society.

Everything I have learned in the last 5 years looks like basic stuff compared to what is being built now, algorithmically and technologically, at an ever rapid pace. All these topics require lot of theory and modelling before getting a clear idea what it is and how to use it for business solutions.

Giving some papers links below on what is being discussed. There is a lot more information available online about what else is being discussed. I think the next few years will be very exciting for AI.

SELF-TAUGHT OPTIMIZER (STOP):RECURSIVELY SELF-IMPROVING CODE GENERATION – https://arxiv.org/pdf/2310.02304.pdf

STaR: Self-Taught Reasoner Bootstrapping Reasoning With Reasoning – https://arxiv.org/abs/2203.14465

LARGE LANGUAGE MODELS CAN SELF-IMPROVE – https://arxiv.org/abs/2210.11610

Tree of Thoughts: Deliberate Problem Solving with Large Language Models – https://arxiv.org/pdf/2305.10601.pdf

Good article on synthetic data – https://www.interconnects.ai/p/llm-synthetic-data

RoboAdvisory Algorithm using Macroeconomic data

RandomForest Regression model for predicting US 10 year Treasury Bond Prices…

DataWisdomX – Data Science course – Introductory videos to all lectures

Data Science – End 2 End Beginners Course Part 1 –…

RoboAdvisory Algorithm using Macroeconomic data

RandomForest Regression model for predicting US 10 year Treasury Bond Prices…

DataWisdomX – Data Science course – Introductory videos to all lectures

Data Science – End 2 End Beginners Course Part 1 –…

RandomForest Regression model for predicting US 10 year Treasury Bond Prices…

DataWisdomX – Data Science course – Introductory videos to all lectures

Data Science – End 2 End Beginners Course Part 1 –…

KDnuggets – Top Data Science, Machine Learning Methods Used, 2018/2019

RandomForest Regression model for predicting US 10 year Treasury Bond Prices…

DataWisdomX – Data Science course – Introductory videos to all lectures

Data Science – End 2 End Beginners Course Part 1 –…

Youtube – MIT OpenCourseWare – Statistics lecture series

YouTube tutorials – Stanford NLP Lecture series

OpenAI, LLM – Q*, AGI, Self Taught Reasoning, Optimizing, Synthetic data

EDITOR PICKS

RoboAdvisory Algorithm using Macroeconomic data

RandomForest Regression model for predicting US 10 year Treasury Bond Prices...

DataWisdomX – Data Science course – Introductory videos to all lectures

POPULAR POSTS

Pandas for Data Wrangling – tutorial, cheat sheet

ML Map – Choosing the right algorithm for your problem

Geoffrey Hinton, Father of Deep Learning, research articles page

POPULAR CATEGORY

When not to use Deep Learning – LSTM (Long Short-term Memory)...

Comparing AWS, Azure, Google Cloud for AI/ML model training, MLOps, GPU...