There has been a recent uptick in the development of general-purpose multimodal AI assistants capable of following visual and written directions, thanks to the remarkable success of Large Language Models (LLMs). By utilizing the impressive reasoning capabilities of LLMs and information found in huge alignment corpus (such as image-text pairs), they demonstrate the immense potential…
In this short piece, I use public Wikipedia data, Python programming, and network analysis to extract and draw up a network of Oscar-winning actors and actresses. All images were created by the author. Wikipedia, as the largest free, crowdsourced online encyclopedia, serves as a tremendously rich data source on various public domains. Many of these…
The intersection of artificial intelligence and creativity has witnessed an exceptional breakthrough in the form of text-to-image (T2I) diffusion models. These models, which convert textual descriptions into visually compelling images, have broadened the horizons of digital art, content creation, and more. Yet this rapidly evolving area of Personalized T2I generation study grapples with several core…
Some of the struggles I face frequently as a data scientist Photo by ThisIsEngineering from Pexels: https://www.pexels.com/photo/female-software-engineer-coding-on-computer-3861972/Ostensibly, it may seem that being a data scientist is all sunshine and rainbows (at least I think that is the perception I give from my posts!). High pay, great benefits, flexible hours, and interesting work are some things…
Deep learning has revolutionized view synthesis in computer vision, offering diverse approaches like NeRF and end-to-end style architectures. Traditionally, 3D modeling methods like voxels, point clouds, or meshes were employed. NeRF-based techniques implicitly represent 3D scenes using MLPs. Recent advancements focus on image-to-image approaches, generating novel views from collections of scene images. These methods often…
An end-to-end implementation of a Pytorch Transformer, in which we will cover key concepts such as self-attention, encoders, decoders, and much more. 17 min read · 18 hours ago Photo by Susan Holt Simpson on UnsplashWhen I decided to dig deeper into Transformer architectures, I often felt frustrated when reading or watching…
Careers in data are not for everyone — you need patience to work with evolving business, security, and infrastructure requirements and a good amount of mental endurance to work with endless data issues and changes. But they can also be the most interesting jobs in the world. Every month has a new puzzle to put…
MoD-SLAM is a state-of-the-art method for Simultaneous Localization And Mapping (SLAM) systems. In SLAM systems, it is challenging to achieve real-time, accurate, and scalable dense mapping. To address these challenges, researchers have introduced a novel method focusing on unbounded scenes using only RGB images. Existing neural SLAM methods often rely on RGB-D input which leads…
Choosing the model that works best for your data We’ll use the EU AI act as the data corpus for our embedding model comparison. Image by Dall-E 3.OpenAI recently released their new generation of embedding models, called embedding v3, which they describe as their most performant embedding models, with higher multilingual performances. The models come…
Data privacy has grown to be a major concern for both individuals and companies in the current digital environment.The amount of personal data being collected and processed is increasing. Therefore, it is essential to establish reliable systems that protect individual privacy while also providing valuable data and analysis.
We will discuss the importance of…