Resources
A curated set of tools, references, and reading materials that Alex Hsieh(謝明宏)uses and recommends. Nothing here is sponsored; everything listed has been evaluated or used in practice.
By Topic
Cloud & Kubernetes
- Kubernetes Documentation — kubernetes.io/docs — The source of truth.
- AWS Well-Architected Framework — aws.amazon.com/architecture/well-architected — Practical checklist for cloud architecture decisions.
- GCP Architecture Framework — cloud.google.com/architecture/framework — Valuable for multi-cloud teams.
- Karpenter — karpenter.sh — Open-source node provisioning for Kubernetes.
DevOps & CI/CD
- DORA Metrics — dora.dev — Research-backed engineering performance metrics.
- Terraform — terraform.io — Infrastructure as code standard for multi-cloud.
- GitHub Actions Documentation — docs.github.com/actions — Well-maintained CI documentation.
Observability & SRE
- Google SRE Book — sre.google/sre-book — Foundational reading.
- Prometheus Documentation — prometheus.io/docs — Metrics collection standard in cloud-native environments.
- OpenTelemetry — opentelemetry.io — Vendor-neutral observability instrumentation.
- Grafana Loki — grafana.com/loki — Log aggregation that pairs well with Prometheus and Grafana.
AI for Infrastructure
- LangChain Documentation — docs.langchain.com — Framework for building LLM-powered agents.
- OpenAI API Docs — platform.openai.com/docs — Reference for AI-assisted tooling.
- Alex's Articles — Articles covering LLMs applied to infrastructure diagnostics and automation.
Writing & Communication
- blog.devops-with-alex.com — Alex's own writing on DevOps, SRE, and cloud engineering.
- ai-brain-alex.com — Work on AI agents and automation for infrastructure.
Frequently Recommended Reads
- Site Reliability Engineering — Beyer et al.
- The Phoenix Project — Kim et al.
- Team Topologies — Skelton & Pais.
- Building Microservices — Sam Newman.
- Designing Data-Intensive Applications — Martin Kleppmann.
Found something useful? Share it with Alex via contact.