Data Engineering

8.5K members Est. May 6, 2022 Updated Feb 10, 2026
Sachin Chandrashekhar @MentorSachin · Feb 7
A small win I wanted to share with you 💙
I’ve been featured in Topmate’s “25 in 25 – Data Engineering” list and ranked #5 among data engineering creators in India.
This list recognises mentors and creators who are contributing consistently to learning, mentoring, and https://t.co/g3op0bX12t
Tweet media
0
0
0
185
0
Sachin Chandrashekhar @MentorSachin · Feb 7
Spark memory management is responsible for 70%+ of interview questions.
It's the #1 reason production jobs fail.
Yet most data engineers guess their configs.

The problem starts with understanding overhead.

When you set executor memory, Spark adds overhead:
Overhead = max(384
0
0
0
54
1
Brahmareddy @BrahmaWritings · Feb 6
https://t.co/Zd6rr8CV5Q
0
0
0
237
0
Tanuj @tanujDE3180 · Feb 4
AI feels magical until the data pipeline breaks.

Then it’s just statistics waiting on a fix.

Data engineering is the real backbone.
0
0
0
265
0
Sachin Chandrashekhar @MentorSachin · Feb 4
Krishna worked at Infosys on legacy tech.
Just became a father.

He asked his manager for modern tech.
Offered to train others on current stack.
Got AWS and Snowflake.

Cloud was completely new to him.

He joined my bootcamp.
Finished it while managing new parent
0
0
1
4.4K
0
Sachin Chandrashekhar @MentorSachin · Feb 3
I just published Why Experienced Data Professionals Struggle to Crack Cloud Data Engineering Interviews https://t.co/EZjhcK2tl3
0
0
0
101
0
Sachin Chandrashekhar @MentorSachin · Feb 1
Overwhelmed by too many tools, services, and jargon?

I created a clear, step-by-step roadmap.
The one I personally teach in my bootcamps.

From foundational skills to advanced pipeline design:

Python and SQL fundamentals
Spark, Glue, PySpark
S3, Redshift, Athena, Lake Formation
0
0
1
6.7K
2
Tanuj @tanujDE3180 · Jan 29
Would you prefer

Working for 3 months to build, test, deploy & maintain a data pipeline.

Or

Build an AI agent in 3 hours that would build, test, deploy & maintain a data pipeline.
1
0
2
5.6K
1
Sachin Chandrashekhar @MentorSachin · Jan 29
I wouldn't advise trying data engineering right out of college in India.

There are rare cases where fresh graduates find DE roles.

Data engineering demands expertise, experience,
and deep understanding of complex systems.

It's more than just writing code:

Designing
0
0
1
4.2K
0
Deepak Narayan @deepaknrn · Jan 27
#dezoomcamp
🐳 Module 1 of Data Engineering Zoomcamp done!

- Docker containers
- Postgres & SQL
- Terraform & GCP
- NYC taxi data pipeline

My solution: https://t.co/N3MLMJsK3r
Free course by @DataTalksClub: https://t.co/gn7EfGWrnw
0
0
0
3.6K
1
Tanuj @tanujDE3180 · Jan 26
How did you prepare for Data Engineer interviews?
0
0
0
1.5K
0
Sachin Chandrashekhar @MentorSachin · Jan 26
Git can be confusing for beginners.
But there's not a lot you need to learn to get interview-ready.

I never used Git until 2019.
Used GitHub for a Machine Learning course project.

2020 was when I started AWS work.
Since then, regular Git user.

Master these fundamentals:
0
0
0
774
0
Tanuj @tanujDE3180 · Jan 26
Serious question: what part of your job do you hope AI never touches?
0
0
0
127
0
Sachin Chandrashekhar @MentorSachin · Jan 20
After 17 years, if I were starting data engineering today,
here are the top 6 things I'd focus on:

Data Modeling:
Master creating efficient structures.
Understand relational, dimensional, and NoSQL models.
Know when to choose each.

Data Integration:
Combine different sources
0
0
3
9.9K
7
Sachin Chandrashekhar @MentorSachin · Jan 17
Your data is lying to you.
And it might be costing your company millions.

73% of datasets have nulls.
Average cost of bad data: $3.1M.
Time to fix it in PySpark: 8 minutes.

If you're ignoring missing values, your analytics are wrong.

Detect nulls: https://t.co/R3VNOZrKV8
Tweet media
1
0
1
5.7K
1
Sachin Chandrashekhar @MentorSachin · Jan 16
"My Spark job failed but I had 8 GB executor memory."
Ever said this yourself?

Let me tell you a hard truth:

Spark memory management is responsible for 70%+ of interview questions.
It's also the #1 reason production jobs fail.
Yet most data engineers guess their memory configs. https://t.co/RFxp3bPeMN
Tweet media
0
1
2
3.3K
1
Sachin Chandrashekhar @MentorSachin · Jan 15
Most candidates learn Spark DataFrames and Spark SQL.

Then the interview question hits:
"How do you optimize Spark jobs in production?"

That's when memorized answers fail.

Here's what actually matters:

Use Parquet for columnar reads.
Partition data if downstream consumers
1
1
1
2.8K
0
Sachin Chandrashekhar @MentorSachin · Jan 13
Here’s the hard truth: Companies pay a premium for engineers who understand data warehousing.
If you’ve only built pipelines or done ETL transformations — you’re missing a key piece of the puzzle.

Skills in demand:

Dimensional modeling & star/snowflake schemas
Partitioning,
0
0
1
2.7K
1
Sachin Chandrashekhar @MentorSachin · Jan 12
Data warehousing is here to stay.

When I joined Cognizant in 2015, there was an entire business unit called:
Enterprise Information Management (formerly DWBI—Data Warehousing & BI).

One BU. Just for data warehousing.

That's how crucial this foundation is.
And with Amazon
0
0
2
3.0K
2
Sachin Chandrashekhar @MentorSachin · Jan 9
Batch vs Streaming isn't about technology preference.
It's about business requirements vs operational complexity.

Streaming sounds exciting. Kafka, Spark Streaming, real-time dashboards.
But streaming means: always-on infrastructure, complex failure handling, and 3 AM debugging.
0
0
2
5.8K
0

Sachin Chandrashekhar

@MentorSachin

50K LinkedIn|On a mission to get Data professionals AWS Data Engineering Jobs! | Founder - Data Engineering Hub | Lead Data Engineer @ World’s #1 Airline

358 Followers
14 Contributions

Tanuj

@tanujDE3180

Principal Software Engineer | Building Data & AI systems at scale | Career, pay, architecture & hard lessons | Travel & lifestyle occasionally ✈️

290 Followers
4 Contributions

Brahmareddy

@BrahmaWritings

Data Engineer | Building & researching Data, AI & ML | Sharing real experiences | USA 🇮🇳🇺🇸

288 Followers
1 Contributions

Deepak Narayan

@deepaknrn

🇮🇳 & 🇭🇲 / IT Professional / Films 🎥 , Sports 🏏 & Music 🎶 / #ThalapathyVijay @actorvijay ❤️ @tvkvijayhq

3.5K Followers
1 Contributions
8.5K
Total Members
1
24h Growth
1
7d Growth
Date Members Change
Feb 10, 2026 8.5K -1
Feb 9, 2026 8.5K +0
Feb 8, 2026 8.5K +0
Feb 7, 2026 8.5K +0
Feb 6, 2026 8.5K +0
Feb 5, 2026 8.5K +0
Feb 4, 2026 8.5K +0
Feb 3, 2026 8.5K +0
Feb 2, 2026 8.5K +0
Feb 1, 2026 8.5K +0
Jan 31, 2026 8.5K +0
Jan 30, 2026 8.5K +0
Jan 29, 2026 8.5K -1
Jan 28, 2026 8.5K

No reviews yet

Be the first to share your experience!

Talk about data engineering and other data-related topics

Community Rules

Be kind and respectful.
Keep Tweets on topic.
Explore and share.
No spam. Keep self-promotion to one tweet per week

No promotions from companies

Don't repost your tweets here

We know you want people to see your tweets. But this community is not the right place for that