Base configuration - cloud-init 25.1.2 documentation (original) (raw)
Toggle table of contents sidebar
Warning
This documentation is intended for custom image creators, such as distros and cloud providers, not end users. Modifying the base configuration should not be necessary for end users and can result in a system that may be unreachable or may no longer boot.
Cloud-init
base config is primarily defined in two places:
/etc/cloud/cloud.cfg
/etc/cloud/cloud.cfg.d/*.cfg
See the configuration sources explanation for more information on how these files get sourced and combined with other configuration.
Generation¶
cloud.cfg
isn’t present in any of cloud-init
’s source files. Theconfiguration is templated and customized for each distribution supported by cloud-init
.
Base configuration keys¶
Module keys¶
Modules are grouped into the following keys:
cloud_init_modules
: Modules run during networktimeframe.cloud_config_modules
: Modules run during configtimeframe.cloud_final_modules
: Modules run during finaltimeframe.
Each module
definition contains an array of strings, where each string is the name of the module. Each name is taken directly from the module filename, with the cc_
prefix and .py
suffix removed, and with -
and _
being interchangeable.
Alternatively, in place of the module name, an array of<name>, <frequency>[, <args>]
args may be specified. Seethe module creation guidelines for more information on frequency
and args
.
Note
Most modules won’t run at all if they’re not triggered via a respective user-data key, so removing modules or changing the run frequency is not a recommended way to reduce instance boot time.
Examples¶
To specify that only cc_final_message.py run during final timeframe:
cloud_final_modules:
- final_message
To change the frequency from the default of ALWAYS
to ONCE
:
cloud_final_modules:
- [final_message, once]
To include default arguments to the module (that may be overridden by user-data):
cloud_final_modules:
- [final_message, once, "my final message"]
Datasource keys¶
Many datasources allow configuration of the datasource for use in querying the datasource for meta-data using the datasource
key. This configuration is datasource dependent and can be found under each datasource’s respective documentation. It will generally take the form of:
datasource: : ...
System info keys¶
These keys are used for setup of cloud-init
itself, or the datasource or distro. Anything under system_info
cannot be overridden by vendor-data, user-data, or any other handlers or transforms. In some cases there may be asystem_info
key used for the distro, while the same key is used outside ofsystem_info
for a user-data module. Both keys will be processed independently.
system_info
: Top-level key.paths
: Definitions of common paths used bycloud-init
.
*cloud_dir
: Default:/var/lib/cloud
.
*templates_dir
: Default:/etc/cloud/templates
.distro
: Name of distro being used.default_user
: Defines the default user for the system using the same user configuration as Users and Groups. Note that this CAN be overridden if ausers
configuration is specified without a- default
entry.ntp_client
: The default NTP client for the distro. Takes the same form asntp_client
defined in NTP.package_mirrors
: Defines the package mirror info for apt.ssh_svcname
: The SSH service name. For most distros this will be eitherssh
orsshd
.network
: Top-level key for distro-specific networking configuration.
*renderers
: Prioritized list of networking configurations to try on this system. The first valid entry found will be used. Options are:
*eni
: For/etc/network/interfaces
.
*network-manager
*netplan
*networkd
: Forsystemd-networkd
.
*freebsd
*netbsd
*openbsd
*activators
: Prioritized list of networking tools to try to activate network on this system. The first valid entry found will be used. Options are:
*eni
: Forifup
/ifdown
.
*netplan
: Fornetplan generate
/netplan apply
.
*network-manager
: Fornmcli connection load
/nmcli connection up
.
*networkd
: Forip link set up
/ip link set down
.apt_get_command
: Command used to interact with APT repositories. Default:apt-get
.apt_get_upgrade_subcommand
: APT subcommand used to upgrade system. Default:dist-upgrade
.apt_get_wrapper
: Command used to wrap the apt-get command.
*enabled
: Whether to use the specifiedapt_wrapper
command. If set toauto
, use the command if it exists on thePATH
. Default:true
.
*command
: Command used to wrap anyapt-get
calls. Default:eatmydata
.
Logging keys¶
See the logging explanation for a comprehensive logging explanation. Note that cloud-init
has a default logging definition that shouldn’t need to be altered. It is defined in this instance at /etc/cloud/cloud.cfg.d/05_logging.cfg
.
The logging keys used in the base configuration are as follows:
logcfg
¶
A standard python fileConfig formatted log configuration. This is the primary logging configuration key and will take precedence overlog_cfgs
or log_basic
keys.
log_cfgs
¶
A list of logging configs in fileConfig format to apply when running cloud-init
. Note that log_cfgs
is used in/etc/cloud.cfg.d/05_logging.cfg
.
log_basic
¶
Boolean value to determine if cloud-init
should apply a basic default logging configuration if none has been provided. Defaults to true
but only takes effect if logcfg
or log_cfgs
hasn’t been defined.
output
¶
If and how to redirect stdout
/stderr
. Defined in/etc/cloud.cfg.d/05_logging.cfg
and explained inthe logging explanation.
syslog_fix_perms
¶
Takes a list of <owner:group>
strings and will set the owner ofdef_log_file
accordingly.
def_log_file
¶
Only used in conjunction with syslog_fix_perms
. Specifies the filename to be used for setting permissions. Defaults to /var/log/cloud-init.log
.
Other keys¶
network
¶
The network configuration to be applied to this instance.
datasource_pkg_list
¶
Prioritized list of python packages to search when finding a datasource. Automatically includes cloudinit.sources
.
datasource_list
¶
This key contains a prioritized list of datasources that cloud-init
attempts to discover on boot. By default, this is defined in/etc/cloud/cloud.cfg.d
.
There are a few reasons to modify the datasource_list
:
- Override default datasource discovery priority order
- Force cloud-init to use a specific datasource: A single entry in the list (or a single entry and
None
) will override datasource discovery, which will force the specified datasource to run. - Remove known invalid datasources: this might improve boot speed on distros that do not use
ds-identify
to detect and select the datasource,
Warning
This key is unique in that it uses a subset of YAML syntax. It requiresthat the key and its contents, a list, must share a single line - no newlines.
vendor_data
/vendor_data2
¶
Allows the user to disable vendor_data
or vendor_data2
along with providing a prefix for any executed scripts.
Format is a dict with enabled
and prefix
keys:
enabled
: A boolean indicating whether to enable or disable thevendor_data
.prefix
: A path to prepend to anyvendor_data
-provided script.
allow_userdata
¶
A boolean value to disable the use of user-data. This allows custom images to prevent users from accidentally breaking closed appliances. Setting allow_userdata: false
in the configuration will disablecloud-init
from processing user-data.
manual_cache_clean
¶
By default, cloud-init searches for a datasource on every boot. Setting this to true
will disable this behaviour. This is useful if your datasource information will not be present every boot. Default: false
.
Example¶
On an Ubuntu system, /etc/cloud/cloud.cfg
should look similar to:
The top level settings are used as module and base configuration.
A set of users which may be applied and/or used by various modules
when a 'default' entry is found it will reference the 'default_user'
from the distro configuration specified below
users:
- default
If this is set, 'root' will not be able to ssh in and they
will get a message to login instead as the default $user
disable_root: true
This will cause the set+update hostname module to not operate (if true)
preserve_hostname: false
If you use datasource_list array, keep array items in a single line.
If you use multi line array, ds-identify script won't read array items.
Example datasource config
datasource:
Ec2:
metadata_urls: [ 'blah.com' ]
timeout: 5 # (defaults to 50 seconds)
max_wait: 10 # (defaults to 120 seconds)
The modules that run in the 'init' stage
cloud_init_modules:
- seed_random
- bootcmd
- write_files
- growpart
- resizefs
- disk_setup
- mounts
- set_hostname
- update_hostname
- update_etc_hosts
- ca_certs
- rsyslog
- users_groups
- ssh
The modules that run in the 'config' stage
cloud_config_modules:
- wireguard
- snap
- ubuntu_autoinstall
- ssh_import_id
- keyboard
- locale
- set_passwords
- grub_dpkg
- apt_pipelining
- apt_configure
- ubuntu_pro
- ntp
- timezone
- disable_ec2_metadata
- runcmd
- byobu
The modules that run in the 'final' stage
cloud_final_modules:
- package_update_upgrade_install
- fan
- landscape
- lxd
- ubuntu_drivers
- write_files_deferred
- puppet
- chef
- ansible
- mcollective
- salt_minion
- reset_rmc
- scripts_vendor
- scripts_per_once
- scripts_per_boot
- scripts_per_instance
- scripts_user
- ssh_authkey_fingerprints
- keys_to_console
- install_hotplug
- phone_home
- final_message
- power_state_change
System and/or distro specific settings
(not accessible to handlers/transforms)
system_info:
This will affect which distro class gets used
distro: ubuntu
Default user name + that default users groups (if added/used)
default_user: name: ubuntu doas: - permit nopass ubuntu lock_passwd: True gecos: Ubuntu groups: [adm, cdrom, dip, lxd, sudo] sudo: ["ALL=(ALL) NOPASSWD:ALL"] shell: /bin/bash network: dhcp_client_priority: [dhclient, dhcpcd, udhcpc] renderers: ['netplan', 'eni', 'sysconfig'] activators: ['netplan', 'eni', 'network-manager', 'networkd']
Automatically discover the best ntp_client
ntp_client: auto
Other config here will be given to the distro class and/or path classes
paths: cloud_dir: /var/lib/cloud/ templates_dir: /etc/cloud/templates/ package_mirrors: - arches: [i386, amd64] failsafe: primary: http://archive.ubuntu.com/ubuntu security: http://security.ubuntu.com/ubuntu search: primary: - http://%(ec2_region)s.ec2.archive.ubuntu.com/ubuntu/ - http://%(availability_zone)s.clouds.archive.ubuntu.com/ubuntu/ - http://%(region)s.clouds.archive.ubuntu.com/ubuntu/ security: [] - arches: [arm64, armel, armhf] failsafe: primary: http://ports.ubuntu.com/ubuntu-ports security: http://ports.ubuntu.com/ubuntu-ports search: primary: - http://%(ec2_region)s.ec2.ports.ubuntu.com/ubuntu-ports/ - http://%(availability_zone)s.clouds.ports.ubuntu.com/ubuntu-ports/ - http://%(region)s.clouds.ports.ubuntu.com/ubuntu-ports/ security: [] - arches: [default] failsafe: primary: http://ports.ubuntu.com/ubuntu-ports security: http://ports.ubuntu.com/ubuntu-ports ssh_svcname: ssh
configure where output will go
output: init: "> /var/log/my-cloud-init.log" config: [ ">> /tmp/foo.out", "> /tmp/foo.err" ] final: output: "| tee /tmp/final.stdout | tee /tmp/bar.stdout" error: "&1"
Set true
to enable the stop searching for a datasource on boot.
manual_cache_clean: False
def_log_file and syslog_fix_perms work together
if
- logging is set to go to a log file 'L' both with and without syslog
- and 'L' does not exist
- and syslog is configured to write to 'L'
then 'L' will be initially created with root:root ownership (during
cloud-init), and then at cloud-config time (when syslog is available)
the syslog daemon will be unable to write to the file.
to remedy this situation, 'def_log_file' can be set to a filename
and syslog_fix_perms to a string containing ":"
def_log_file: /var/log/my-logging-file.log syslog_fix_perms: syslog:root