Skip to content
Navigation menu
Search
Powered by
Search
Algolia
Search
Log in
Create account
DEV Community
Close
#
bigdata
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Using DolphinScheduler API to Achieve Efficient Batch Workflow Import and Script Deployment
Chen Debra
Chen Debra
Chen Debra
Follow
Jan 22
Using DolphinScheduler API to Achieve Efficient Batch Workflow Import and Script Deployment
#
api
#
programming
#
tooling
#
bigdata
5
reactions
Comments
Add Comment
3 min read
Essential Skills Every Aspiring Data Scientist Should Acquire for Career Success (2025)
Argha Sarkar
Argha Sarkar
Argha Sarkar
Follow
Jan 21
Essential Skills Every Aspiring Data Scientist Should Acquire for Career Success (2025)
#
datascience
#
bigdata
#
cloud
#
webdev
Comments
Add Comment
3 min read
Run PySpark Local Python Windows Notebook
chuongmep
chuongmep
chuongmep
Follow
Jan 21
Run PySpark Local Python Windows Notebook
#
bigdata
#
python
#
spark
#
dataengineering
Comments
Add Comment
3 min read
Data formats - how and when
Ashok Nagaraj
Ashok Nagaraj
Ashok Nagaraj
Follow
Jan 17
Data formats - how and when
#
csv
#
json
#
parque
#
bigdata
Comments
Add Comment
3 min read
Top 10 Tools for Efficient Web Scraping in 2025
WISDOMUDO
WISDOMUDO
WISDOMUDO
Follow
Jan 16
Top 10 Tools for Efficient Web Scraping in 2025
#
webscraping
#
datascience
#
automation
#
bigdata
2
reactions
Comments
Add Comment
4 min read
Vector search using Alibaba Cloud inference API and semantic text
A_Lucas
A_Lucas
A_Lucas
Follow
Jan 20
Vector search using Alibaba Cloud inference API and semantic text
#
bigdata
#
elasticsearch
#
tutorial
#
api
Comments
Add Comment
10 min read
When to use Apache Xtable or Delta Lake Uniform for Data Lakehouse Interoperability
Alex Merced
Alex Merced
Alex Merced
Follow
Jan 7
When to use Apache Xtable or Delta Lake Uniform for Data Lakehouse Interoperability
#
dataengineering
#
dataanalytics
#
datascience
#
bigdata
Comments
Add Comment
5 min read
Using Apache Parquet to Optimize Data Handling in a Real-Time Ad Exchange Platform
Matan Shidlov
Matan Shidlov
Matan Shidlov
Follow
Jan 7
Using Apache Parquet to Optimize Data Handling in a Real-Time Ad Exchange Platform
#
bigdata
#
dataengineering
#
datascience
#
machinelearning
2
reactions
Comments
Add Comment
3 min read
Compression algorithms in Parquet Java
Jerónimo López
Jerónimo López
Jerónimo López
Follow
Jan 20
Compression algorithms in Parquet Java
#
parquet
#
java
#
compression
#
bigdata
5
reactions
Comments
2
comments
7 min read
The Columnar Approach: A Deep Dive into Efficient Data Storage for Analytics 🚀
Madhav
Madhav
Madhav
Follow
Jan 6
The Columnar Approach: A Deep Dive into Efficient Data Storage for Analytics 🚀
#
database
#
bigdata
#
dataengineering
#
analytics
Comments
Add Comment
4 min read
Rethinking distributed systems: Composability, scalability
Juan José de las Heras
Juan José de las Heras
Juan José de las Heras
Follow
Jan 14
Rethinking distributed systems: Composability, scalability
#
composablearchitecture
#
distributedsystems
#
ai
#
bigdata
Comments
Add Comment
5 min read
Goodbye Kafka: Build a Low-Cost User Analysis System
ksanaka
ksanaka
ksanaka
Follow
Dec 5 '24
Goodbye Kafka: Build a Low-Cost User Analysis System
#
database
#
kafka
#
bigdata
Comments
Add Comment
5 min read
Query 1B Rows in PostgreSQL >25x Faster with Squirrels!
Tim Huang
Tim Huang
Tim Huang
Follow
Dec 18 '24
Query 1B Rows in PostgreSQL >25x Faster with Squirrels!
#
postgres
#
dataengineering
#
analytics
#
bigdata
Comments
8
comments
5 min read
Introduction to Hadoop:)
Madhav Ganesan
Madhav Ganesan
Madhav Ganesan
Follow
Nov 24 '24
Introduction to Hadoop:)
#
hadoop
#
bigdata
#
nlp
#
llm
6
reactions
Comments
Add Comment
10 min read
Big Data Trends That Will Impact Your Business In 2025
TechDogs
TechDogs
TechDogs
Follow
for
TechDogs
Dec 24 '24
Big Data Trends That Will Impact Your Business In 2025
#
bigdata
#
trends
#
2025
#
technology
5
reactions
Comments
Add Comment
6 min read
The Heart of DolphinScheduler: In-Depth Analysis of the Quartz Scheduling Framework
Chen Debra
Chen Debra
Chen Debra
Follow
Nov 20 '24
The Heart of DolphinScheduler: In-Depth Analysis of the Quartz Scheduling Framework
#
apachedolphinscheduler
#
quartz
#
opensource
#
bigdata
8
reactions
Comments
Add Comment
3 min read
SQL Filtering and Sorting with Real-life Examples
Millie Molotov
Millie Molotov
Millie Molotov
Follow
Dec 23 '24
SQL Filtering and Sorting with Real-life Examples
#
database
#
sql
#
mysql
#
bigdata
Comments
Add Comment
4 min read
Big Data
williamxlr
williamxlr
williamxlr
Follow
Nov 13 '24
Big Data
#
bigdata
#
hadoop
#
spark
Comments
Add Comment
1 min read
Introduction to Data lakes: The future of big data storage
Hiswill Thompson
Hiswill Thompson
Hiswill Thompson
Follow
Dec 14 '24
Introduction to Data lakes: The future of big data storage
#
bigdata
#
dataengineering
5
reactions
Comments
Add Comment
2 min read
Construyendo una aplicación con Change Data Capture (CDC) utilizando Debezium, Kafka y NiFi
Javier Andre Neira Machaca
Javier Andre Neira Machaca
Javier Andre Neira Machaca
Follow
Dec 14 '24
Construyendo una aplicación con Change Data Capture (CDC) utilizando Debezium, Kafka y NiFi
#
cdc
#
bigdata
1
reaction
Comments
Add Comment
3 min read
5 effektive Methoden, um Bilder aus Webseiten zu extrahieren
hanna Fischer
hanna Fischer
hanna Fischer
Follow
Dec 12 '24
5 effektive Methoden, um Bilder aus Webseiten zu extrahieren
#
webscraping
#
bigdata
#
bilder
#
firefox
1
reaction
Comments
Add Comment
3 min read
The Apache Iceberg™ Small File Problem
Danica Fine
Danica Fine
Danica Fine
Follow
Dec 11 '24
The Apache Iceberg™ Small File Problem
#
bigdata
#
apacheiceberg
#
datalakehouse
#
dataengineering
5
reactions
Comments
Add Comment
3 min read
System Design 09 - Data Partitioning: Dividing to Conquer Big Data
Sarva Bharan
Sarva Bharan
Sarva Bharan
Follow
Nov 12 '24
System Design 09 - Data Partitioning: Dividing to Conquer Big Data
#
systemdesign
#
bigdata
#
datapartition
Comments
Add Comment
2 min read
Understanding Star Schema vs. Snowflake Schema
Puneet Verma
Puneet Verma
Puneet Verma
Follow
Nov 16 '24
Understanding Star Schema vs. Snowflake Schema
#
dataengineering
#
datascience
#
datamodeling
#
bigdata
Comments
Add Comment
1 min read
Introduction to Messaging Systems with Kafka
Yasmine Cherif
Yasmine Cherif
Yasmine Cherif
Follow
Nov 28 '24
Introduction to Messaging Systems with Kafka
#
distributedsystems
#
bigdata
#
kafka
#
programming
Comments
Add Comment
16 min read
loading...
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account