We are looking for a skilled professional to design and develop efficient SQL queries while maintaining views, models, and data structures across federated and transactional databases. This role supports analytics and reporting by ensuring seamless data integration and optimized performance. The ideal candidate will have strong expertise in advanced SQL development, complemented by proficiency in Python for data exploration and scripting, as well as shell scripting for lightweight automation tasks. This position requires a deep understanding of database management and query optimization to enhance data accessibility and system efficiency.
Key Responsibilities:
- Develop complex SQL queries for data extraction, transformation, and loading processes.
- Create and maintain views, materialized views, and data models to facilitate reporting and analytics.
- Optimize federated queries and manage efficient joins across multiple database platforms.
- Conduct performance tuning, indexing, and query optimization to improve database responsiveness and efficiency.
Required Qualifications:
- Advanced proficiency in SQL with hands-on experience in MS SQL Server, Oracle DB, PostgreSQL, and columnar databases such as DuckDB.
- Proven experience in federated data access and managing distributed data environments.
- Strong knowledge of Apache Arrow columnar data format, Flight SQL, and Apache Calcite frameworks.
- Proficient in Python for data exploration, scripting, and automation tasks.
- Experience with shell scripting to automate routine processes and support system maintenance.
Preferred Qualifications and Benefits:
- Familiarity with data modeling concepts, including ER diagrams and schema design best practices.
- Experience with backend reporting tools and layers, such as Power BI datasets.
- Understanding of utility operations and power distribution systems is considered an advantage.
- Exposure to cloud-hosted databases and data lakes within cloud ecosystems is a plus.
- Optional knowledge of Grid CIM (Common Information Model; IEC 61970, IEC 61968), GE ADMS DNOM (Distribution Network Object Model), and GE GridOS Data Fabric is beneficial.
This role offers the chance to work with advanced data technologies in a dynamic environment focused on improving data performance and accessibility across complex, multi-database systems. Candidates will contribute to innovative solutions that support critical analytics and reporting functions.