Skip to content

šŸŽ‰ One Month in Singapore's AI Scene šŸ¤–

Published:

Itā€™s been an action-packed first month in Singapore for me, settling into the new country and immersing myself in the vibrant AI community here! Hereā€™s an overview of what Iā€™ve been up to and my main takeaways from 24 hours of meetups and conferences on Generative AI, Machine Learning for finance, and Data Engineering.

These events were on the following topics:

Data Tuesday #1

Members at the first ever Data Tuesday Singapore. Photo by Ville

27th July - šŸ¤– Autonomous Agents with Large Language Models (LLMs) at Google Developers Space

At Machine Learning Singaporeā€™s event, Sam Witteveen from RedDragon.aiā€™s presented:

Samā€™s YouTube channel is a goldmine for anyone keen on the latest GenAI developments.

Martin Andrews, fellow co-founder, shared:

Check out Martinā€™s insightful presentation at https://redcatlabs.com/2023-07-27_MLSG_OpenEnded/#/openended-talk.

Sam presenting Autonomous Agents with LLM

Sam presenting Autonomous Agents with LLM

28th July - šŸ„ Generative AI: Improving Patient Care with AI and LLMs

H2O.ai hosted a session on Real-life use-cases of Generative AI: Improving Patient Care with AI and LLMs. Jong Hang demonstrated 4 practical examples in the hospital and military sectors and how you can use chain-of-thought prompting for instruction design to combat hallucinations and ensure the GPT only answers questions within its domain / task.

Vishal Sharma showcased two open-source H2O.ai tools:

  1. h2oGPT - harness LLMā€™s fine-tuned by the worldā€™s best Kaggle Grandmasters. Have a play online at https://gpt.h2o.ai/ or clone and run locally at https://github.com/h2oai/h2ogpt.
  2. LLM Studio for fine-tuning your LLMs in a no-code GUI with your own data, with LoRA, QLoRA, and even Reinforced Learning from Human Feedback (RLHF). Available to clone at https://github.com/h2oai/h2o-llmstudio.

1st August - šŸ’½ Data Tuesday #1

Kicked off the first-ever Data Tuesday by Ville Kulmala. Itā€™s a relaxed space for genuine data discussions, youā€™ll know where to find me every month!

2nd & 3rd August - šŸŒ World AI Show at the Marina Bay Sands Exhibition Center

Itā€™s hard to summarise the 14 talks I attended across the two day AI conference (this could easily have its own blog post). The standout session for me was the panel discussion on AI and Data Driven Decision Making between Robert Hollinger, Adrien Chenailler, Jeannette Pang, and Ram Thilak. They explained how they are making an impact for their customers in the banking and automotive sectors with AI and ML through:

Underpinning all this, I was glad to hear how much emphasis they put on data ethics, through:

Full list of topics covered
  1. Synergey of Digital Transformation and AI: Powering Organisational Growth
  2. Embarking on a Journey to Democratise AI at Scale
  3. Make Data Science a Team Sport
  4. The Emergence of AI
  5. Revolutionising Customer Experience with Conversational AI
  6. Automation in Data Management: Enhancing Efficiency & Saving Time
  7. AI and Data Driven Decision Making Panel Discussion
  8. Generative AI: A Game Changer?
  9. The Why, Where and How of Enterprise AI Adoption
  10. Sustainable AI for Humanity
  11. Blockchain for Healthcare
  12. Emergence of Web3 and Gaming and Virtual Worlds
  13. Building Trustworthy and Ethnical AI Panel Discussion
  14. Securing the Future of AI: Addressing Privacy, Security, and Compliance in LLMs

Intro to World AI Show

Opening event to World AI Show

2nd August - šŸ› ļø dbt meetup

I learnt about SQL-centric data engineering at a dbt SG meetup. Two Data Engineers from Teleport showcased how dbt revolutionised their workflow, with the following examples:

Iā€™ll definitely be experimenting with this open-source tool and seeing how it can applied to my future workflows.

3rd August - šŸ§  How to break into AI Careers

Thu Ya Kyaw (SideQuest founder), Koo Ping Shung (co-founder of DataScienceSG) and Poh Wan Ting and Michelle Lim (Mastercard Lead/Senior Data Scientists) shared their unique journeys on How to break into AI careers. My takeaways were:

16 August - šŸ‘·ā€ā™‚ļø OCBC Data Engineering Revolution

Periyasamy Sivakumar gave a great deep dive into OCBCā€™s modern Data Engineering tech stack, at Data Engineering SG meetup. This tech stack consists of:

It was great to hear the full story on:

OCBC Data Engineering

How OCBC utilise data

18 August - šŸ¦… Launch of BrightRaven.ai

Celebrated Singaporeā€™s newest AI startup BrightRaven.ai fantastic launch party at the Mondrian Duxton. It was great mingling with Bertrand Lee (founder) and co and hearing their wealth of experience, whilst enjoying the rooftop view! Best wishes to the team on their AI consultancy journey!

21 August - šŸ’µ Application of ML and GenAI in Finance with Fullerton Fund Management

Kai Xin (co-founder of DataScienceSG) wonderfully explained the LLM stack using a burger analogy. Where the bun is analogous to the static components, such as prompt engineering and Retrieval-Augmented Generation (RAG), and the burger patty representing the re-useable components that can be swapped for different ā€˜flavoursā€™ of fine-tuned LLM models. This is made possible through the use of adapters which can identify and switch to the correct fine-tuned model based on the user input. Kai Xinā€™s slides can be found at bit.ly/practical-genai-ft.

Kai Xin also showed easy it is do to Practical Generative AI Fine-Tuning. In his live-demo he fine-tuned a Flan-T5 model for financial sentiment analysis in just 8 minutes for free in a Colab notebook, using HuggingFaceā€™s implementation of Quantised LoRA (QLoRA).

Chao Jen shared how they use clustering models to model the Global Macro Regime in combination with a second model for regime transitions - to help their portfolio managers with asset relocation by considering the current global situation. The model outputs were very clearly and intuitively visualised by stacked bar charts showing the probability distributions for each regime. It was impressive to hear how theyā€™ve trained 1000s of models, but there is still a strong need for domain knowledge and judgement between the data scientists and portfolio managers.

Shi Hui shared the importance of using time-based cross-validation for predicting winners in the stock market - not to use Scikit-Learnā€™s method which can introduce bias and has a limitation of assuming one observation per day. For this type of analysis, ideally youā€™d have at least two cycles of the economy, so at least 5-7 years. Some useful predictive external signals include estimates data for company fundamentals, news sentiment, and investor company visits.

Yan Rong shared how to predict the U.S. Treasuries Yield Curve with PCA Decomposition for global market simulation. YTC is the rate that U.S. banks can borrow money. Yan Rong showed how you can reduce from 11 features to 3 components and still capture 99% of the variance. You can isolate these components in line charts to understand the intuition in the YTC movements and do scenario simulations. The code and slides by the Fullerton data scientists can be found at https://github.com/shihuiFFMC/dssg_aug2023.

Kai Xin at DataScience Singapore

Kai Xin summarising the LLM landscape

29 August - āš›ļø Data Tuesday #2 - Quantum Computing

Caught up again with some familiar faces and also new faces at The Terrace again, this time discussing Quantum Computing. Some thoughts that I left with:

31 August - šŸ” Why Vector Search is Important

Yoshi Kimoto from Datastax explained Why Vector Search is Important for LLMs. Yoshi explained how:

You can test Datastaxā€™s vector search demo in your browser here.

Yoshi presenting Why Vector Search is Important

Yoshi on the LLM stack

Wrapping up

Itā€™s been a whirlwind month, and itā€™s been energising to learn and mingle with the brightest AI minds in Singapore. Iā€™m looking forward to applying these insights and sharing more with you all. After all, education without action is simply entertainment!

Lastly, a big thanks to my talented ex-colleagues in the UK civil service at the Department of Health and Social Care for having me in your teams Lucy Vickers, Phil Walmsley, Mariana, Graeme, and Anita Brock.

Thanks for reading and making it all the way down here! If you fancy a chat about any of these topics, drop me an email.

Cheers!

Vince


Previous Post
šŸ” Unmasking the Unusual: An Intro to Anomaly Detection
Next Post
Hello and Welcome! šŸ‘‹
-->