Articles

Generative AI (ChatGPT and image generation) for architectural and civil engineering domains

Generative AI (Generative AI), such as ChatGPT (interactive AI) and Midjourney (image generation), has been the topic of much discussion and is expected to revolutionize society and industries.


What is Generative AI?


Definition of Generative AI

Since generative AI is in its infancy, definitions of the term differ slightly.


Generative AI refers to a category of AI that uses systems called neural networks to analyze data, find patterns, and use those patterns to generate or create new output, such as text, photos, video, code, or data.

Microsoft (Microsoft News Center, 2023)


Generative AI refers to deep learning models that can generate high-quality text, images, and other content based on data that has been trained.

IBM (Martineau, 2023)


Generative AI refers to algorithms (such as ChatGPT) that can be used to create new content such as voice, code, images, text, simulations, and video.

McKinsey & Company (McKinsey & Company, 2023 )


In summary.


  • Deep learning is used to
  • Based on trained data
  • Generate new content based on learned data


The key point of the definition of generative AI is that it uses deep learning to generate new content based on learned data.


History of Generative AI

There are two main streams of development in generative AI: natural language processing and computer vision.


  • In natural language processing, it started with Hidden Markov Models (HMMs) (Knill & Young, 1997)and Gaussian Mixture Models (GMMs) (Reynolds & Others, 2009), which generated continuous data such as speech and time series; since the advent of deep learning, the performance of generative models has improved significantly, with Recurrent neural networks (RNN) being introduced for language modeling tasks and being able to model relatively long dependency relations; after the Transformer architecture (Vaswani et al., 2017) was developed, this model has been the main backbone of various subsequent generative models, including GPT, to the present.
  • コンピュータビジョンでは、Texture synthesis (Efros & Leung, 1999) and Texture mapping (Heckbert, 1986) から始まり、これらのアルゴリズムは、手作業で設計された特徴量に基づき、複雑で多様な画像を生成するというものでしたが、大量に生成する能力には限界がありました。2014年には、Generative Adversarial Networks (GANs) (Goodfellow et al., 2020)やVariational Autoencoders (VAEs)など、拡散生成モデル(Song & Ermon, 2019) などの手法も開発され、画像生成プロセスをよりきめ細かく制御し、高品質な画像を生成することができるようになり、その後、Transformer architecture (Vaswani et al., 2017)をコンピュータービジョンと組み合わせることでこの概念をさらに進め、画像ベースの出力を可能にしました。そしてこれによって、マルチモーダルなタスクを実現することも可能にしました。


In the future, we can expect to see more accurate generation and multimodalization, where the types and combinations of inputs and outputs will become more diverse.


Types of Generative AI

The major generative AI models that have emerged in recent years are classified according to the form of input and generation as shown in the figure below (Gozalo-Brizuela & Garrido-Merchan, 2023).

The input is often text, as it is based on theTransformer architecture (Vaswani et al., 2017), which is specialized for natural language processing ; the generated formats include not only text and images, but also 3D, video, audio, program code, etc. Flaming and VisualGPT have images as input format.

Classification of the main generative AI models (Gozalo-Brizuela & Garrido-Merchan, 2023)


How Generative AI is Used in Architecture and Civil Engineering

This section describes how generative AI can be used in the architectural and civil engineering domain, beginning with an overview of how AI has been used in the architectural and civil engineering domain, followed by a description of the potential applications of generative AI.


Examples of AI Applications to Date

Planning and Design Process

  • Design generation and emulation (Huang & Zheng, 2018; Liu et al, 2017)
  • Design evaluation (Lorenz et al, 2018; Y. Zhang et al, 2018)
  • Prediction of structural response and performance, interpretation of experimental data, and pattern recognition of structural health monitor data in structural design (Sun et al, 2021)
  • Create BIM models from laser scanner data (Tang et al, 2010)
  • Analyze BIM logs and investigate ways to improve design productivity (Pan & Zhang, 2020)


Automated facility design service (PillarPlus)

https://pillarplus.com/ai-tech/


Construction process


A service that installs cameras on construction equipment to avoid collisions with workers (ViAct.ai)

https://www.viact.ai/vimac


A service that detects high-risk accident scenes from cameras at construction sites (mign)

https://prtimes.jp/main/html/rd/p/000000018.000100410.html


Operation and Maintenance Processes

  • Automatic detection and assessment of defects and damage (cracks, delamination, corrosion, holes, joint damage, etc.) in various types of social infrastructure, such as buildings, bridges, tunnels, roads, sewer pipes , etc. (C. Zhang et al., 2020)
  • Detect and diagnose building energy system (e.g. HVAC) failures (Y. Zhao et al, 2019)
  • Predict building age(Tooke et al, 2014)
  • Electric load forecasting for smart grids (Raza & Khosravi, 2015)


How Generative AI is Used

Based on how AI has been used in the building and civil engineering domain to date, as described above, this section describes how generative AI can be used.


Planning and design process


Interview phase

  • Summarize and abstract interviews with clients and government into text data.
  • Automatically output estimates based on interviews with clients.


Literature Review Phase

  • Pick up important information from requirement documents of competitive bidding projects.
  • A dialogue model tuned to construction projects allows the AI to ask questions about zoning and other regulations in each area.


Design phase

  • Automatically outputs design images based on interviews with the client.
  • Displays design images and generates multiple related images.
  • Generate 3D models and import them into BIM/CIM.


Automatic architectural perspective generation tool (mign)

https://prtimes.jp/main/html/rd/p/000000013.000100410.html


Construction Process


Procurement phase

  • New quotations and purchase orders can be created based on information from previous quotations and purchase orders.


Construction phase

  • Interactive model tuned to construction work, allowing users to ask questions if they have any doubts about how to proceed with construction.


Interactive AI (mign) trained on construction-related regulations

https://prtimes.jp/main/html/rd/p/000000020.000100410.html


Operation and maintenance management process


Monitoring phase

  • Input images of the structure and air conditioning, and output a failure diagnosis (FDD) report


Maintenance phase

  • Input images and videos of deteriorated parts of the building or structure and automatically generate a maintenance report.




Summary

We have reviewed the history and types of generative AI, examples of AI applications to date, and discussed how generative AI can be used, essentially replacing the advanced skills and knowledge of humans between input and output.


References

Alavipour, S. M. R., & Arditi, D. (2019). time-cost tradeoff analysis with minimized project financing costs. Automation in Construction, 98, 110-121.

Efros, A. A., & Leung, T. K. (1999). Texture synthesis by non-parametric sampling. Proceedings of the Seventh IEEE International Conference on Computer Vision, 2, 1033-1038 vol. 2.

Fang, W., Love, P. E. D., Luo, H., & Ding, L. (2020). Computer vision for behaviour-based safety in construction: A review and future directions. Advanced Engineering Informatics, 43, 100980.

Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., & Bengio, Y. (2020). Generative adversarial networks. Communications of the ACM, 63 (11), 139-144.

Gozalo-Brizuela, R., & Garrido-Merchan, E. C. (2023). ChatGPT is not all you need. a State of the Art Review of large Generative AI models. In arXiv [cs.LG]. arXiv. http://arxiv.org/abs/2301.04655

Hatami Mohsen, Flood Ian, Franz Bryan, & Zhang Xun.(2019). a State-of-the-Art Review on the Applicability of AI Methods to Automated Construction Manufacturing. Journal of Computing in Civil Engineering, 368-375.

Heckbert, P. S. (1986). survey of Texture Mapping. IEEE Computer Graphics and Applications, 6 (11), 56-67.

Himeur, Y., Ghanem, K., Alsalemi, A., Bensaali, F., & Amira, A. (2021). Artificial intelligence based anomaly detection of energy consumption in buildings: A review, current trends and new perspectives. Applied Energy, 287, 116601.

Knill, K., & Young, S. (1997). Hidden Markov Models in Speech and Language Processing. In S. Young & G. Bloothooft (Eds.), Corpus-Based Methods in Language and Speech Processing (pp. 27-68). Springer Netherlands.

Kuritcyn, P., Anding, K., Linß, E., & Latyev, S. M. (2015). Increasing the safety in recycling of construction and demolition waste by using supervised machine learning. Journal of Physics. Conference Series, 588, 012035.

Lorenz, C.-L., Packianather, M., Spaeth, A., & Bleil De Souza, C. (2018). artificial neural network-based modelling for daylight evaluations. SimAUD 2018, Delft, The Netherlands . https://orca.cardiff.ac.uk/id/eprint/112040/

Martineau, K. (2023, April 20). What is generative AI? IBM Research Blog; IBM. https://research.ibm.com/blog/what-is-generative-AI

McKinsey & Company.(2023, January 19). What is generative AI? McKinsey & Company. https://www.mckinsey.com/featured-insights/mckinsey-explainers/what-is-generative-ai

Microsoft News Center.(2023, April 4). AI explained. Stories. https://news.microsoft.com/2023/04/04/ai-explained/

Pan, Y., & Zhang, L. (2020). BIM log mining: Exploring design productivity characteristics. Automation in Construction, 109, 102997.

Reynolds, D. A., & Others. (2009). Gaussian mixture models. Encyclopedia of Biometrics, 741 (659-663). http://leap.ee.iisc.ac.in/sriram/teaching/MLSP_16/refs/GMM_Tutorial_Reynolds.pdf

Song, Y., & Ermon, S. (2019). Generative modeling by estimating gradients of data distribution. Advances in Neural Information Processing Systems, 32. https://proceedings.neurips.cc/paper/2019/hash/3001ef257407d5a371a96dcd947c7d93-Abstract.html

Tang, P., Huber, D., Akinci, B., Lipman, R., & Lytle, A. (2010). Automatic reconstruction of as-built building information models from laser-scanned point clouds: A review of related techniques. Automation in Construction, 19 (7), 829-843.

Tian, J., Gao, M., & Zhou, S. (2009). The Research of Building Logistics Cost Forecast Based on Regression Support Vector Machine. 2009 International Conference on Computational Intelligence and Security, 1, 648-652.

Tooke, T. R., Coops, N. C., & Webster, J. (2014). Predicting building ages from LiDAR data with random forests for building energy modeling. Energy and Buildings, 68, 603-610.

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., & Polosukhin, I. (2017). Attention is all you need. Advances in Neural Information Processing Systems, 30. https://proceedings.neurips.cc/paper/7181-attention-is-all

Wang, Z., & Srinivasan, R. S. (2017). A review of artificial intelligence based building energy use prediction: contrasting the capabilities of single and ensemble prediction models. Renewable and Sustainable Energy Reviews, 75, 796-808.

Zhang, C., Chang, C.-C., & Jamshidi, M. (2020). Concrete bridge surface damage detection using a single-stage detector. Computer-Aided Civil and Infrastructure Engineering, 35 (4), 389-409.

Zhang, Y., Burton, H. V., Sun, H., & Shokrabadi, M. (2018). a machine learning framework for assessing post-earthquake structural safety. Structural Safety, 72, 1-16.


Back to articles