Things I tried as team lead: #1

[1] Lack of motivation within the team
[2] Collaboration with external teams
[3] US delivery management without supervision/support
[4] Pre mortem goals
Build architecture patterns for the cloud
More automation around Infrastructure Provisioning
Roll-out Well Architected framework for - AWS, Kubernetes
Increased collaboration with other cloud teams
Evolving Platform Team concept - Increased collaboration with Platform Tech Leads
Building self-sufficient team
[5] External Comms
[6] Individual feedback
[7] Support tasks
[8] Exposure to team around - Support, Debugging, Prod Release
[9] Rotation of responsiblities
[10] PR's

Answer 17 · 2022-03-28T21:25:18.000Z

https://every.to/superorganizers/managing-your-manager

Answer 18 · 2022-03-28T21:43:11.000Z

https://www.mckinsey.com/business-functions/people-and-organizational-performance/our-insights/if-were-all-so-busy-why-isnt-anything-getting-done

Answer 19 · 2022-04-03T10:10:52.000Z

https://threadreaderapp.com/thread/1510285946285178887.html

Answer 20 · 2022-04-04T12:36:58.000Z

https://swizec.com/blog/why-senior-engineers-get-nothing-done/

Answer 21 · 2022-04-04T20:53:42.000Z

https://www.johnwhiles.com/posts/work.html

Answer 22 · 2022-04-04T20:55:32.000Z

https://www.actitime.com/productivity/developer-productivity-tips

Answer 23 · 2022-04-04T20:59:37.000Z

https://incident.io/blog/do-i-need-an-incident-debrief

Answer 24 · 2022-04-06T15:49:06.000Z

https://blog.pragmaticengineer.com/performance-reviews-for-software-engineers/

Answer 25 · 2022-04-07T08:45:20.000Z

https://dev.to/aws-heroes/architecture-devops-and-delivery-teams-need-to-think-differently-to-enable-serverless-12b1

Answer 26 · 2022-04-10T13:12:22.000Z

https://blog.getambassador.io/is-platform-engineering-the-new-devops-or-sre-472ed97a1885

Answer 27 · 2022-04-20T22:05:38.000Z

https://erik.wiffin.com/posts/how-to-get-the-most-out-of-your-11s

Answer 28 · 2022-04-26T08:01:04.000Z

https://www.fastcompany.com/90719658/dont-quit-yet-this-is-how-to-make-your-current-job-fit-your-needs

Answer 29 · 2022-05-09T21:58:19.000Z

https://www.freecodecamp.org/news/create-a-habit-system-and-stay-motivated-as-a-developer/

Answer 30 · 2022-05-19T23:58:07.000Z

https://developers.google.com/tech-writing

Answer 31 · 2022-05-20T20:54:41.000Z

https://mfdela.medium.com/in-defence-of-tech-debts-in-software-engineering-209685ae0e27

Answer 32 · 2022-05-25T16:59:36.000Z

https://blog.xendit.engineer/scaling-infrastructure-as-code-culture-in-xendit-6e84056ca617

Answer 33 · 2022-05-31T23:49:21.000Z

Teams to be trained on the technology they will be working on
Teams to have at least one person who is an SME on topics e.g. Spark or have SME group for each topic/platform teams who will fill this need
Dedicated team/guild/architect analysing and setting up standards/recommendations to be followed
TPO’s working with business to have adequate timeline for iterative delivery to meet production standards
Tech Leads pushing back on TPO to get adequate timeline for iterative delivery to meet production standards
Not just trying to deliver fast to have nice metrics
Not forcing teams with tight deadlines and moving to another work item without considering the fact that the previous work item needs to be maintained by someone on a recurring basis, whether it’s fully productionized etc.

Also, things to be considered for cloud migration –

Someone has to manage the infrastructure in the cloud – moving to cloud doesn’t mean AWS will take care of everything
Adequate funding/training/skillset needs to be given so that resources can stabilize the platform and avoid major P1 incidents down the line in terms of
Having fine grained access controls in place
Auditing
Monitoring/Alerting
Well architected design and many more
Above shouldn’t require a justification – it needs to be quite obvious

Answer 34 · 2022-06-01T08:49:51.000Z

https://www.linkedin.com/pulse/great-employees-dont-complain-walk-away-ian-daley

Answer 35 · 2022-06-01T11:28:42.000Z

https://serverlessfirst.com/emails/quantifying-the-cost-of-introducing-extra-moving-parts-into-your-serverless-architecture/

Answer 36 · 2022-06-02T13:20:59.000Z

Aspect	Standalone Application	Capability on Platform
Architecture and Security Reviews	Get proposed arch and infrastructure patterns reviewed and approved by Security Team	Adopt the existing security patterns and go for security review in case of any major arch changes
Infrastructure	Build new environments from the ground up	Leverage existing and provision additional resources as necessary
Implementation Strategy (CI/CD, Code, Test, etc.)	Set it up as you see fit for team. Can adopt patterns from the existing teams as well.	Embrace the practices from the platform engineering team and add additional ones if necessary
Code Starter Kits:	Build one news and/or adopt and customize existing ones from other teams	Adopt existing ones from platform teams and customize components if necessary
Community Support	Team will get up to speed on the UHG ecosystem, procedures and controls but can get additional support as needed	Help on offer from Platform team. Can't expect Platform team to support us in every step of the way though!
UHC.OPEN/Code reusability	Isolated applications have limited opportunities for inner source. Right mindset can still get things done.	Platform Engineering opens up synergies for collaboration and reuse.
Technology Upgrades	No dependency on other teams to pilot things. Free to execute disruptive things as long as business is not impacted	Collaborate with Platform team on new implementations. Need to take the entire platform ecosystem into consideration for impact!
Cost	It depends on what we can reuse? Should also take time invested by team in learning and building everything from the ground up	Reusability plays a big role in cost numbers
Speed to Market	It depends on what we can reuse? Should also take time invested by team in learning and building everything from the ground up	Reusability plays a big role in speed to market
Enterprise Direction	Isolated applications are still in use, depends on the use case and timeline.	Platform Engineering is the new normal!

Answer 37 · 2022-06-14T15:54:24.000Z

https://martinfowler.com/articles/product-backlog-building-canvas.html

Answer 38 · 2022-06-21T17:36:03.000Z

https://www.lennysnewsletter.com/p/my-favorite-templates-issue-37?s=r

Answer 39 · 2022-06-21T17:37:30.000Z

https://blog.pragmaticengineer.com/scaling-engineering-teams-via-writing-things-down-rfcs/

Answer 40 · 2022-06-21T17:38:31.000Z

https://www.industrialempathy.com/posts/design-docs-at-google/

Answer 41 · 2022-06-22T03:11:41.000Z

https://sharedphysics.com/everything-is-important/

Answer 42 · 2022-06-27T16:36:44.000Z

https://sheeri.org/its-the-little-things/?

Answer 43 · 2022-06-30T14:47:58.000Z

https://threadreaderapp.com/thread/1542061516912037890.html

Questions to ask to determine whether something is really urgent

Answer 44 · 2022-07-14T00:43:09.000Z

HarshadRanganathan commented 2 years ago

Answer 45 · 2022-07-14T23:21:49.000Z

HarshadRanganathan commented 2 years ago

Answer 46 · 2022-07-16T11:43:05.000Z

https://www.infoq.com/podcasts/engineering-empathy

Answer 47 · 2022-07-18T17:37:37.000Z

https://untools.co/

Answer 48 · 2022-07-18T17:39:18.000Z

https://www.navalmanack.com/almanack-of-naval-ravikant/table-of-contents

Answer 49 · 2022-07-18T17:40:45.000Z

https://fs.blog/

Answer 50 · 2022-07-21T21:43:35.000Z

https://medium.com/walmartglobaltech/building-a-platform-team-d915221d5654

Answer 51 · 2022-07-22T19:05:25.000Z

https://rework.withgoogle.com/

Answer 52 · 2022-07-25T13:10:23.000Z

https://slite.com/blog/micromanagement-is-not-a-bad-word

Answer 53 · 2022-07-26T10:50:52.000Z

HarshadRanganathan commented 2 years ago

Answer 54 · 2022-07-30T12:03:19.000Z

https://www.infoq.com/articles/devops-governance-developer-velocity/

Answer 55 · 2022-07-31T19:21:16.000Z

https://www.theengineeringmanager.com/qa/how-do-i-get-better-at-giving-feedback

Answer 56 · 2022-08-01T15:30:22.000Z

https://blog.gruntwork.io/cloud-adoption-fails-65295aff30cc

Answer 57 · 2022-08-02T17:53:08.000Z

https://blog.pragmaticengineer.com/oncall-compensation/

A: Oncall for software engineers is additional.

“Being oncall is your one and only job.”
“It’s not part of the job outside business hours.
“It’s not part of the job outside business hours, but we might still try to reach you during those times.”
“It’s part of the job for all software engineers and we operate in regions which regulate how it needs to be compensated with pay and time off.”
“It’s part of the job, but we recognize the disruption with pay and additional time off.”
“It’s voluntary for most people, and we encourage it with pay and time off.”

B: Oncall is part of the job:

“It’s part of the job for all software engineers and not paid additional.”

Being oncall can be quite disruptive in two major ways:

It disrupts your personal plans, outside of work.
It disrupts your sleep.

Compensation approaches

Flat rate per week or per day of being oncall
Flat rate for standby, plus pay for hours worked outside core hours
Only pay for incidents worked on out-of-hours

Several engineers working at the company told me oncall operational load is high, teams are understaffed, oncall is not paid, and someone even used the term “oncall prison,” as quoted above.

What is the reason for the high oncall lead?

Growing too fast
Too many custom systems
Attrition for experienced people
No backfills
A barely acknowledged tech debt problem
Light at the end of the tunnel

Why are poor oncall practices painful?

They can directly impact software engineer attrition and wellbeing. Simply put, poor oncall practices will lead to more engineers quitting, more people getting burnt out and fewer people recommending a company.

Takeaways

Oncall for software engineers is part of the job. Many companies operate like this, most notably Big Tech – save for Google – and many high-growth startups. The more an employer compensates software engineers, the more likely they expect oncall to be a given.
Oncall for software engineers is additional. Companies which care either about healthy oncall practices or want to minimize attrition for software engineers, make it clear oncall is additional and offer some sort of compensation. Compensation may be cash, or it could be time, or it could be lightening the load with dedicated SREs or DevOps people, or making the rotations voluntary.

Answer 58 · 2022-08-07T12:35:17.000Z

https://www.docker.com/blog/building-stronger-happier-engineering-teams-with-team-topologies/

Answer 59 · 2022-08-09T13:50:41.000Z

https://lucasfcosta.com/2022/08/07/how-to-improve-daily-standups.html

Answer 60 · 2022-08-09T17:07:40.000Z

https://sifted.eu/articles/how-to-upskill-engineers/

Answer 61 · 2022-08-09T20:28:55.000Z

https://architectelevator.com/cloud/serverless-design-patterns/

Answer 62 · 2022-08-12T21:52:51.000Z

https://specbranch.com/posts/one-big-server/

Answer 63 · 2022-08-12T21:54:09.000Z

https://holub.com/kpis-velocity-and-other-destructive-metrics/

Answer 64 · 2022-08-12T22:07:52.000Z

https://longform.asmartbear.com/posts/extreme-questions/

Answer 65 · 2022-08-12T22:17:54.000Z

https://www.nutshell.com/blog/accidental-complexity-software-design

Answer 66 · 2022-08-18T22:38:56.000Z

https://medium.com/wise-engineering/platform-engineering-kpis-6a3215f0ee14

Answer 67 · 2022-08-18T23:30:58.000Z

https://blog.ceejbot.com/posts/reduce-friction/

Answer 68 · 2022-08-18T23:31:43.000Z

https://brooker.co.za/blog/2020/10/19/big-changes.html

Answer 69 · 2022-08-18T23:32:29.000Z

https://talktotheduck.dev/external-debugging-tools-3-jmxterm

Answer 70 · 2022-08-18T23:35:17.000Z

https://zwischenzugs.com/2022/08/08/who-should-write-the-terraform/

Answer 71 · 2022-08-23T22:59:11.000Z

https://www.alexdebrie.com/posts/serverless-framework-vs-cdk/

Answer 72 · 2022-08-30T18:48:05.000Z

https://www.priconceptions.com/notebook/remote-jobs-bad

Answer 73 · 2022-09-04T19:24:29.000Z

HarshadRanganathan commented 2 years ago

Answer 74 · 2022-09-14T22:26:47.000Z

HarshadRanganathan commented 2 years ago

Answer 75 · 2022-09-14T22:27:06.000Z

HarshadRanganathan commented 2 years ago

Answer 76 · 2022-09-15T11:29:24.000Z

HarshadRanganathan commented 2 years ago

Answer 77 · 2022-09-30T10:21:58.000Z

	Ideal State	Current State	Problem Root Cause	Potential Solutions
Guild Meetings	Occurs as planned and gets cancelled occasionally due to overlaps with any other important meetings such as Town Hall	Sometimes guilds get cancelled in successive weeks without reason	Guild Owners are busy with their project delivery/other reasons/lack of agenda	Maybe Guilds should be run by a group of team members rather than one person/replacements
Agenda	Agenda is either well planned in advance or topics are gathered & discussed on the fly (lean meetings)	Sometimes no agenda/lack of audience/no active engagement from everyone	Guild Owners are busy with their project delivery/other reasons so no agenda sometimes Very less participation in the guilds due to lack of awareness/culture/tight delivery deadlines/time zone etc.	Below suggestion from Kevin Guild cohort should reach out to teams and build out an agenda based on their pain points/invite them to share their pain points/try different approaches and see what works
Purpose	Stays on track with the Guild’s mission/goal statements	Sometimes goes off-track discussing items which aren’t relevant to the purpose of that particular guild	Guild Owners are passionate to discuss about what interests them/lack of existing space to discuss things (busy calendar for everyone) so Guild is used as the place to discuss anything	Rename Guilds to make them more generic/send agenda in advance so only interested/relevant folks join/stay focused on the goals
Outcomes	Outcomes are assigned to Team members/Teams Outcomes are tracked Achieved outcomes are demoed	Some items no owner is identifiable	Lack of interest/lack of clarity as to who should own/deliver on it	Assign items to Platform teams backlog/interested teams/individuals – give them the needed support (moral + funding + time) as well
Participation	At least teams/members relevant to the guild participate	Siloed conversations e.g. A team actively developing an API in our platform doesn’t participate in the guild e.g. A team has already built a solution for a problem that the guild is actively discussing about	Lack of participation due to less awareness/culture/tight delivery deadlines/time zone etc. Lack of targeted communication	Project leads/TLs should encourage their teams to participate Guild should ensure relevant scum teams are part of the conversation

Answer 78 · 2022-10-27T17:31:26.000Z

Meetings:

Everyone spoke and gave their opinions and as importantly were heard
Everyone was interested and passionate about the subject
Everyone had genuine positive intentions to make things better and solve problems
Meeting notes (or a transcript) were captured
Actionable items and clear takeaways were captured (with folks volunteering to take ownership of those takeaways)
The meeting started on time and ended on time
The phrase “that doesn’t make sense” was not used once (I am guilty of that one – have to work on my phrasing)
At least two “dumb” questions were asked

Answer 79 · 2022-10-31T19:05:07.000Z

HarshadRanganathan commented 2 years ago

Answer 80 · 2022-11-25T13:48:06.000Z

HarshadRanganathan commented 2 years ago

Answer 81 · 2022-12-06T22:54:18.000Z

HarshadRanganathan commented 2 years ago

Answer 82 · 2022-12-07T13:11:34.000Z

HarshadRanganathan commented 2 years ago