Retrieval Augmented Generation (RAG) is a well-known approach to creating generative AI applications. RAG combines large language models (LLMs) with external world knowledge retrieval and is increasingly popular for adding accuracy and personalization to AI. It retrieves relevant information from external sources, augments the input with this data, and generates responses based on both. This […]
From Challenges to Opportunities: The AI-Data Revolution
By Kamal Hathi, SVP and GM, Splunk Products & Technology Today’s fast-evolving digital landscape, especially with the explosive growth of AI, has rapidly added to the complexity of data management. This growing dependence on AI has not only added to complexity, but also transformed strategic data management from a competitive advantage into a business imperative. Data […]
The best laptops of 2025: I’ve tested dozens of laptops and these are the best ones
Laptops come in a variety of different form factors these days, with manufacturers playing into the different categories to develop an intended use case. For example, lightweight laptops are made to be carried around, and trade in some raw power for portability. 2-in-1 laptops come with touchscreens that allow for use as a tablet. The […]
Amplifying Women’s Voices in Data Leadership workshop co-hosted by Women in Big Data Berlin and Thoughtworks
Amplifying Women’s Voices in Data Leadership workshop co-hosted by Women in Big Data Berlin and Thoughtworks A big thank you to everyone who joined us for the “Amplifying Women’s Voices in Data Leadership” workshop co-hosted by Women in Big Data Berlin and Thoughtworks. Together with participants we explored what it truly means to lead with […]
How Data Analytics Reduces Truck Accidents and Speeds Up Claims
One thing that we talk a lot about at Smart Data Collective is how data can be used to reduce risks on the road. It is hard to ignore the growing role of data in accident prevention, especially in the trucking industry. There are more than 43,000 motor vehicle deaths reported by the Department of […]
Building an AI-First Interface for Precisely APIs with Model Context Protocol
Over the past few weeks, I’ve been exploring ways to streamline access to Precisely’s APIs using AI-first tooling. One promising approach has been to leverage the Model Context Protocol (MCP)—an open standard developed by Anthropic—to connect APIs with modern large language model (LLM) interfaces, such as Claude Desktop. Today, I’d like to share a lightweight […]
Big Data Career Notes for June 2025
(metamorworks/Shutterstock) It’s that time of month again–time for Big Data Career Notes, a monthly feature where we keep you up-to-date on the latest career developments for individuals in the big data community. Whether it’s a promotion, new company hire, or even an accolade, we’ve got the details. Check in each month for an updated list […]
Simplifying Healthcare Data and Claims Management: Introducing Databricks X12 EDI Ember
EDI and its role in the Healthcare Ecosystem Electronic Data Interchange (EDI) is a semi-structured data exchange method allowing healthcare organizations like Payers, Providers, etc., to seamlessly share vital transactional information electronically. Its standardized approach ensures accuracy and consistency across healthcare operations. EDI transactions used for various healthcare operations include: Claims submissions, Remittance, and Benefit […]
Amazon OpenSearch Service and Firehose: Rolling over indices automatically
Optimizing OpenSearch clusters with time-series data means optimizing the shard size and count. When ingesting data using AWS Data Firehose, this can be achieved using Rollover Indices. Read on for a guide on how to set this up. When deciding how to organize data into indices in OpenSearch, the first consideration is what kind of […]
Benefits of Using a Development Platform in the Cloud for Businesses
Cloud-based development platforms have transformed the way businesses create and deploy software applications, offering unparalleled advantages in today’s fast-paced digital landscape. Companies across sectors are shifting away from traditional local server setups to embrace the efficiency and flexibility of cloud environments. Accelerated development cycles Modern businesses face intense pressure to deliver software solutions quickly while […]