Image by Author
For a while now, ChatGPT has been in the limelight. Everyone is talking about it, and a lot of people are using it, what could possibly go wrong?
Google has always aimed to maintain its reputation of being an AI-first company, and so far they have been doing well. However, in…
A team of researchers from the University of Wisconsin-Madison, NVIDIA, the University of Michigan, and Stanford University have developed a new vision-language model (VLM) called Dolphins. It is a conversational driving assistant that can process multimodal inputs to provide informed driving instructions. Dolphins are designed to address the complex driving scenarios faced by autonomous vehicles…
Notes [1] Abramson, J., Ahuja, A., Barr, I., Brussee, A., Carnevale, F., Cassin, M., Chhaparia, R., Clark, S., Damoc, B., Dudzik, A. and Georgiev, P., 2020. Imitating interactive intelligence. arXiv preprint arXiv:2012.05672. [2] Abramson, J., Ahuja, A., Brussee, A., Carnevale, F., Cassin, M., Fischer, F., Georgiev, P., Goldin, A., Harley, T. and Hill, F., 2021.…
Three stories about the data career journey “The number 12 is considered a cosmic number — marking the 12 months, the 12 signs of the zodiac, and the 12 stations of the Moon and the Sun” [Photo by Michael Skok on Unsplash]In my previous position, I created end-of-year reports for my business colleagues and CEOs.…
Image by Author
Data science is a vast field, combining elements of statistics, machine learning, and data analysis. To navigate this complex domain, having a set of handy cheat sheets can be immensely helpful.
The cheat sheets can also serve as a valuable resource for preparing for technical interviews, reviewing key concepts, and…
How can the effectiveness of vision transformers be leveraged in diffusion-based generative learning? This paper from NVIDIA introduces a novel model called Diffusion Vision Transformers (DiffiT), which combines a hybrid hierarchical architecture with a U-shaped encoder and decoder. This approach has pushed the state of the art in generative models and offers a solution to…
Research
Published
…
A deep exploration of TiDE, its implementation using Darts and a real life use case comparison with DeepAR (a Transformer architecture) As industries continue to evolve, the importance of an accurate forecasting becomes a non-negotiable asset wether you work in e-commerce, healthcare, retail or even in agriculture. The importance of being able to foresee what…
Image from OpenAI GPT's main view.
In our rapidly evolving digital world, artificial intelligence (AI) is not just a buzzword but a revolutionary force reshaping how we interact with technology.
Ever since ChatGPT was first launched, there’s not been a single week without a big leap forward in the AI field.
Just…
How can Neural Radiance Fields (NeRFs) be improved to handle scale variations and reduce aliasing artifacts in scene reconstruction? A new research paper from CMU and Meta addresses this issue by proposing PyNeRF (Pyramidal Neural Radiance Fields). It improves neural radiation fields (NeRFs) by training model heads at different spatial grid resolutions, which helps…