| The thing is the OpenAI models still seem to be much more practical because they are significantly smarter than other models, especially the open source ones. You can use them without needing to train them. And creating the data for training can be a big effort. Based on the projects you mentioned, my suggestion would be: - to get signed up with OpenAI and make sure you have GPT-4 access set up with the API. - even if the GPT-4 access isn't available immediately, you can get pretty far with gpt-3.5-turbo. - Find a tutorial on using the OpenAI LLM API. - Get the tutorial working and then modify it for your first use case. The prompt can be something like: You will receive a financial transaction encoded as JSON. Output a classification as JSON in this format: { "transaction_type": "food"} where transaction_type is one of "food", "other", "transportation", etc. - For the analytics project, you will want to find an OpenAI LLM tutorial for using their new Functions feature. Give it functions like configureAndShowPage(filters) or maybe even queryAnalyticsData(sql). The AI would write those function calls and parameters out on the fly based on a user question, your program would receive those function calls and execute them and display the result to the user. - For the analytics documentation search, look up something like "OpenAI embedding search" or "llamaindex starter tutorial". As far as open source, I know some of the recent ones are showing more promise, but I still believe they will need significant training to really be useful for most tasks. But I would be really interested to hear if that is not the case, or how someone with a lot of experience with the open source models would approach your use cases. I assume actually the embedding search might work okay with one of the latest embedding models. Still more hassle and possibly more expensive to run than OpenAI. Is the idea to use open source only because you are worried your wife will get mad if you spend $20 on the OpenAI API or something? I mean I get it if you just prefer to use open source in general. I would like to also. Its just that until the coding and other abilities for open source models get better, it seems much more practical to skip all the training and hosting and just use the OpenAI API with their general-purpose and capable models. |